Skip to yearly menu bar Skip to main content


Poster

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Jun Shern Chan · Neil Chowdhury · Oliver Jaffe · James Aung · Dane Sherburn · Evan Mays · Giulio Starace · Kevin Liu · Leon Maksin · Tejal Patwardhan · Aleksander Madry · Lilian Weng
2025 Poster

Abstract

Video

Chat is not available.