Skip to yearly menu bar Skip to main content


Poster Session #2
in
Workshop: Workshop on Scaling Post-training for LLMs (SPOT)
Mon, Apr 27, 2026 • 10:30 AM – 11:10 AM PDT

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Amrith Setlur ⋅ Zijian Wang ⋅ Andrew Cohen ⋅ Paria Rashidinejad ⋅ Sang Michael Xie

Abstract

Chat is not available.