PostTrainBench: Measuring how well AI agents can post-train language models

1 pointsposted a month ago
by frozenseven

No comments yet