首页 正文

MACRPO: Multi-agent cooperative recurrent policy optimization

{{output}}
This work considers the problem of learning cooperative policies in multi-agent settings with partially observable and non-stationary environments without a communication channel. We focus on improving information sharing between agents and propose a new multi... ...