Hackernews
new
show
ask
jobs
Bluffbench is near saturation: LLMs can interpret counterintuitive plots
2 points
posted 14 hours ago
by ionychal
(opensource.posit.co)
No comments yet