Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 4 P4-#4608

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs

Huining Yuan ⋅ Zelai Xu ⋅ Zheyue Tan ⋅ Xiangmin Yi ⋅ Mo Guang ⋅ Kaiwen Long ⋅ Haojia Hui ⋅ BOXUN LI ⋅ Xinlei Chen ⋅ Bo Zhao ⋅ Xiao-Ping Zhang ⋅ Chao Yu ⋅ Yu Wang

Abstract

Log in and register to view live content