Hackernews
new
show
ask
jobs
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
3 points
posted 11 hours ago
by giuliomagnifico
(nature.com)
No comments yet