MACRPO: Multi-agent cooperative recurrent policy optimization

This work considers the problem of learning cooperative policies in multi-agent settings with partially observable and non-stationary environments without a communication channel. We focus on improving information sharing between agents and propose a new multi... ...

请注册登录后继续浏览