Hackernews
new
show
ask
jobs
Benchmark for measuring how well AI agents perform at ML engineering
1 points
posted 13 hours ago
by zerojames
(github.com)
No comments yet