Benchmark for measuring how well AI agents perform at ML engineering

1 pointsposted a year ago
by zerojames

No comments yet