Hackernews
new
show
ask
jobs
Task-Specific LLM Evals That Do and Don't Work
1 points
posted 11 hours ago
by eigenBasis
(eugeneyan.com)
No comments yet