MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering / Artificial Intelligence, Research / By hi@aiweekly.co.in We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.