Skip to yearly menu bar Skip to main content


Poster

metabench - A Sparse Benchmark of Reasoning and Knowledge in Large Language Models

Alex Kipnis ⋅ Konstantinos Voudouris ⋅ Luca Schulze Buschoff ⋅ Eric Schulz
2025 Poster

Abstract

Video

Chat is not available.