Skip to yearly menu bar Skip to main content


Agent Psychometrics: Task-Level Performance Prediction in Agentic Coding Benchmarks

Chris Ge ⋅ Daria Kryvosheieva ⋅ Daniel Fried ⋅ Uzay Girit ⋅ Kaivalya Hariharan

Abstract

Chat is not available.