Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#4814

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Yuqian Fu ⋅ Tinghong Chen ⋅ Jiajun Chai ⋅ Xihuai Wang ⋅ Songjun Tu ⋅ Guojun Yin ⋅ Wei Lin ⋅ Qichao Zhang ⋅ Yuanheng Zhu ⋅ Dongbin Zhao

Abstract

Log in and register to view live content