Hackernews
new
show
ask
jobs
Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering
2 points
posted 10 hours ago
by wek
(arxiv.org)
No comments yet