Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 3 P3-#1408

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Jie Ruan ⋅ Inderjeet Nair ⋅ Shuyang Cao ⋅ Amy Liu ⋅ Sheza Munir ⋅ Micah Pollens-Dempsey ⋅ Yune-Ting Chiang ⋅ Lucy Kates ⋅ Nicholas David ⋅ Sihan Chen ⋅ Ruxin Yang ⋅ Yuqian Yang ⋅ Jihyun Gump ⋅ Tessa Bialek ⋅ Vivek Sankaran ⋅ Margo Schlanger ⋅ Lu Wang

Abstract

Log in and register to view live content