Skip to yearly menu bar Skip to main content


Sycophantic Anchors: Localizing and Quantifying User Agreement in Reasoning Models

Jacek Duszenko

Abstract

Log in and register to view live content