Hackernews
new
show
ask
jobs
Agent evals should feel like real work
2 points
posted 13 hours ago
by zed_labs_dev
(zohaib.cc)
No comments yet