Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 3 P3-#1501

Group Verification-based Policy Optimization for Interactive Coding Agents

Silong Dai ⋅ Changzhi Sun ⋅ Haolun Wu ⋅ Huanran Zheng ⋅ Tao Ji ⋅ Junchi Yan ⋅ Yuanbin Wu ⋅ Dell Zhang ⋅ Xiaoling Wang ⋅ Xuelong Li

Abstract

Log in and register to view live content