Skip to yearly menu bar Skip to main content


Poster

metabench - A Sparse Benchmark of Reasoning and Knowledge in Large Language Models

Alex Kipnis · Konstantinos Voudouris · Luca Schulze Buschoff · Eric Schulz
2025 Poster

Abstract

Video

Chat is not available.