Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Lifelong Agents: Learning, Aligning, Evolving
Sun, Apr 26, 2026 • 11:00 AM – 12:00 PM PDT

Verifying the Verifiers: Failure Attribution for Agentic Benchmark Diagnostics and Training Data Curation

Jesse Hu ⋅ Pratyush Shukla ⋅ Ke Huang

Abstract

Chat is not available.