Skip to yearly menu bar Skip to main content


Oral

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Jun Shern Chan · Neil Chowdhury · Oliver Jaffe · James Aung · Dane Sherburn · Evan Mays · Giulio Starace · Kevin Liu · Leon Maksin · Tejal Patwardhan · Aleksander Madry · Lilian Weng
2025 Oral

Abstract

Video

Chat is not available.