Skip to yearly menu bar Skip to main content


Reward Hacking in Self-Improving Code Agents

Bingchen Zhao ⋅ Dhruv Srikanth ⋅ Yuxiang Wu ⋅ Zhengyao Jiang

Abstract

Chat is not available.