Skip to yearly menu bar Skip to main content


Poster

Reshaping Reasoning in LLMs: A Theoretical Analysis of RL Training Dynamics through Pattern Selection

Xingwu Chen · Tianle Li · Difan Zou

Abstract

Log in and register to view live content