Skip to yearly menu bar Skip to main content


Poster

RAIN-Merging: A Gradient-Free Method to Enhance Instruction Following in Large Reasoning Models with Preserved Thinking Format

Zhehao Huang · Yuhang Liu · Baijiong Lin · Yixin Lou · Zhengbao He · Hanling Tian · Tao Li · Xiaolin Huang

Abstract

Log in and register to view live content