Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 3 P3-#1417

TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning

Shenzhi Yang ⋅ Guangcheng Zhu ⋅ Haobo Wang ⋅ Xing Zheng ⋅ Yingfan MA ⋅ Zhongqi Chen ⋅ Bowen Song ⋅ Weiqiang Wang ⋅ Junbo Zhao ⋅ Gang Chen

Abstract

Log in and register to view live content