Skip to yearly menu bar Skip to main content


Poster

Probing to Refine: Reinforcement Distillation of LLM Reasoners via Explanatory Inversion

Zhen Tan · Chengshuai Zhao · Song Wang · Jundong Li · Tianlong Chen · huan liu

Abstract

Log in and register to view live content