Hackernews
new
show
ask
jobs
Macro Evals for Agentic Systems
2 points
posted 12 hours ago
by gmays
(developers.openai.com)
No comments yet