Skip to yearly menu bar Skip to main content


Scaling Test-Time Compute Without Verification or RL is Suboptimal

Amrith Setlur ⋅ Nived Rajaraman ⋅ Sergey Levine ⋅ Aviral Kumar

Abstract

Chat is not available.