Hackernews
new
show
ask
jobs
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
9 points
posted 5 months ago
by Anon84
(nature.com)
No comments yet