Hackernews
new
show
ask
jobs
Fastest small LLM at 1 KB context is the slowest at 1 MB
1 points
posted 7 hours ago
by mmoustafa
(blog.0xmmo.co)
No comments yet