Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 3 P3-#507

Incentivizing LLM Reasoning via Reinforcement Learning with Functional Monte Carlo Tree Search

Kongcheng Zhang ⋅ QI YAO ⋅ Baisheng Lai ⋅ Jiaxing Huang ⋅ Wenkai Fang ⋅ Dacheng Tao ⋅ Mingli Song ⋅ Shunyu Liu

Abstract

Log in and register to view live content