Skip to yearly menu bar Skip to main content


Poster

Towards Understanding Sycophancy in Language Models

Mrinank Sharma ⋅ Meg Tong ⋅ Tomek Korbak ⋅ David Duvenaud ⋅ Amanda Askell ⋅ Sam Bowman ⋅ Esin DURMUS ⋅ Zac Hatfield-Dodds ⋅ Scott Johnston ⋅ Shauna Kravec ⋅ Timothy Maxwell ⋅ Sam McCandlish ⋅ Kamal Ndousse ⋅ Oliver Rausch ⋅ Nicholas Schiefer ⋅ Da Yan ⋅ Miranda Zhang ⋅ Ethan Perez
2024 Poster

Abstract

Video

Chat is not available.