Skip to yearly menu bar Skip to main content


Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Runze Liu ⋅ Junqi Gao ⋅ Jian Zhao ⋅ Kaiyan Zhang ⋅ Xiu Li ⋅ Biqing Qi ⋅ Wanli Ouyang ⋅ Bowen Zhou

Abstract

Chat is not available.