hackernews client

jerojero

5 months ago

I think one of the most interesting parts here is, and this is something we've been seeing with other models too, how these capabilities can be passed down to smaller models to improve their capabilities.

Personally I'm really interested in on-device models, our phones have gotten pretty good and I think for a lot of things it should be possible to have these capable but not amazing little ants running around in our phones doing things.

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

1 Comments

jerojero