Hackernews
new
show
ask
jobs
We Benchmarked Frontier Reasoning Models on the Atlantic's Bracket City
4 points
posted 11 hours ago
by brgross
(redspring.xyz)
No comments yet