Skip to yearly menu bar Skip to main content


A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior

Harry Mayne ⋅ Justin Kang ⋅ Dewi Gould ⋅ Kannan Ramchandran ⋅ Adam Mahdi ⋅ Noah Y Siegel

Abstract

Log in and register to view live content