Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering

1 pointsposted 10 hours ago
by popey

1 Comments

pqtr2

6 hours ago

Couldn't agree more. Coding benchmarks are just a score. Benchmark the harness.