Confidence estimation is a better metric than agreement for LLM judges

3 pointsposted 7 hours ago
by rapiddev

No comments yet