Blog Track Poster Thu, Apr 23, 2026 • 11:15 AM – 1:45 PM PDT

Loneliness as a Case Study for Social Reward Misalignment

Samantha Adorno ⋅ Akshata Kishore Moharir ⋅ Ratna Kandala

[ OpenReview]

Abstract

The goal of this work is to use loneliness as a clear case study of proxy-reward misalignment in RL. We introduce a simulation where loneliness drifts over time and repeated short-term comfort increases an accumulated harm variable, then compare agents trained on engagement versus long-term well-being. We show that optimizing engagement leads to policies that prioritize immediate relief without improving the underlying state, motivating reward inference or well-being objectives over engagement proxies.

Video

Chat is not available.