Hackernews
new
show
ask
jobs
Confidence estimation is a better metric than agreement for LLM judges
3 points
posted 7 hours ago
by rapiddev
(arxiv.org)
No comments yet