首页 正文

PAC Reinforcement Learning Algorithm for General-Sum Markov Games

{{output}}
This paper presents a theoretical framework for probably approximately correct (PAC) multi-agent reinforcement learning (MARL) algorithms for Markov games. Using the idea of delayed Q-learning, the paper extends the well-known Nash Q-learning algorithm to buil... ...