Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 3 P3-#420

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite

Jonathan Bragg ⋅ Mike D'Arcy ⋅ Nishant Balepur ⋅ Dan Bareket ⋅ Bhavana Dalvi Mishra ⋅ Sergey Feldman ⋅ Dany Haddad ⋅ Jena Hwang ⋅ Peter Jansen ⋅ Varsha Kishore ⋅ Bodhisattwa Prasad Majumder ⋅ Aakanksha Naik ⋅ Sigal Rahamimov ⋅ Kyle Richardson ⋅ Amanpreet Singh ⋅ Harshit Surana ⋅ Aryeh Tiktinsky ⋅ Rosni Vasu ⋅ Guy Wiener ⋅ Chloe Anastasiades ⋅ Stefanus Candra ⋅ Jason Dunkelberger ⋅ Daniel Emery ⋅ Rob Evans ⋅ Malachi Hamada ⋅ Regan Huff ⋅ Rodney Kinney ⋅ Matt Latzke ⋅ Jaron Lochner ⋅ Ruben Lozano-Aguilera ⋅ Ngoc-Uyen Nguyen ⋅ Smita Rao ⋅ Amber Tanaka ⋅ Brooke Vlahos ⋅ Peter Clark ⋅ Doug Downey ⋅ Yoav Goldberg ⋅ Ashish Sabharwal ⋅ Daniel Weld

Abstract

Log in and register to view live content