Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Lifelong Agents: Learning, Aligning, Evolving
Sun, Apr 26, 2026 • 11:00 AM – 12:00 PM PDT

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Siyan Zhao ⋅ Zhihui Xie ⋅ Mengchen Liu ⋅ Jing Huang ⋅ Guan Pang ⋅ Feiyu Chen ⋅ Aditya Grover

Abstract

Chat is not available.