Skip to yearly menu bar Skip to main content


Rethinking Fine-tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning

Feng Chen ⋅ Allan Raventos ⋅ Nan Cheng ⋅ Surya Ganguli ⋅ Shaul Druckmann

Abstract

Chat is not available.