PostTrainBench: Measuring how well AI agents can post-train language models

1 pointsposted 18 hours ago
by frozenseven

No comments yet