Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#4712

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Yulei Qin ⋅ Xiaoyu Tan ⋅ Zhengbao He ⋅ Gang Li ⋅ Haojia Lin ⋅ Zongyi Li ⋅ Zihan Xu ⋅ Yuchen Shi ⋅ Siqi Cai ⋅ Renting Rui ⋅ Shaofei Cai ⋅ Yuzheng Cai ⋅ Xuan Zhang ⋅ Sheng Ye ⋅ Ke Li ⋅ Xing Sun

Abstract

Log in and register to view live content