Skip to yearly menu bar Skip to main content


Poster Session #2
in
Workshop: Workshop on Scaling Post-training for LLMs (SPOT)
Mon, Apr 27, 2026 • 10:30 AM – 11:10 AM PDT

Reinforcement Learning via Self-Distillation

Jonas Hübotter ⋅ Frederike Lübeck ⋅ Lejs Behric ⋅ Anton Baumann ⋅ Marco Bagatella ⋅ Daniel Marta ⋅ Ido Hakimi ⋅ Idan Shenfeld ⋅ Thomas Kleine Buening ⋅ Carlos Guestrin ⋅ Andreas Krause

Abstract

Chat is not available.