Hackernews
new
show
ask
jobs
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
7 points
posted 11 hours ago
by Anon84
(nature.com)
No comments yet