Skip to yearly menu bar Skip to main content


LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision

Jundong Xu ⋅ Hao (Scofield) Fei ⋅ Huichi Zhou ⋅ Xin Quan ⋅ Qijun Huang ⋅ Shengqiong Wu ⋅ William Wang ⋅ Mong-Li Lee ⋅ Wynne Hsu

Abstract

Chat is not available.