Researchers find why larger language models pick up skills that small ones miss

4 pointsposted 8 hours ago
by maxloh

1 Comments

masking

5 hours ago

I for one can’t wait to see a 1.2b, 2b, or 3b model use skills. It would democratize the agentic AI landscape. My 4b model is to slow on my M1Pro. Need quicker access to standardized skills and tool usage.