Skip to yearly menu bar Skip to main content


Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

James Xu Zhao ⋅ Bryan Hooi ⋅ See-Kiong Ng

Abstract

Chat is not available.