Skip to yearly menu bar Skip to main content


Poster

RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code

Dhruv Gautam ⋅ Spandan Garg ⋅ Jinu Jang ⋅ Neel Sundaresan ⋅ Roshanak Zilouchian Moghaddam
2025 Poster

Abstract

Video

Chat is not available.