Skip to yearly menu bar Skip to main content


Poster

MA22E: Addressing Partial Observability in Multi-Agent Reinforcement Learning with Masked Auto-Encoder

Sehyeok Kang · Yongsik Lee · Gahee Kim · Song Chong · Se-Young Yun

Hall 3 + Hall 2B #402
[ ]
Thu 24 Apr midnight PDT — 2:30 a.m. PDT

Abstract: Centralized Training and Decentralized Execution (CTDE) is a widely adopted paradigm to solve cooperative multi-agent reinforcement learning (MARL) problems. Despite the successes achieved with CTDE, partial observability still limits cooperation among agents. While previous studies have attempted to overcome this challenge through communication, direct information exchanges could be restricted and introduce additional constraints. Alternatively, if an agent can infer the global information solely from local observations, it can obtain a global view without the need for communication. To this end, we propose the Multi-Agent Masked Auto-Encoder (MA22E), which utilizes the masked auto-encoder architecture to infer the information of other agents from partial observations. By employing masking to learn to reconstruct global information, MA22E serves as an inference module for individual agents within the CTDE framework. MA22E can be easily integrated into existing MARL algorithms and has been experimentally proven to be effective across a wide range of environments and algorithms.

Live content is unavailable. Log in and register to view live content