Hackernews
new
show
ask
jobs
Unweight: We compressed an LLM 22% without sacrificing quality
4 points
posted 6 hours ago
by jgrahamc
(blog.cloudflare.com)
No comments yet