Hackernews
new
show
ask
jobs
Show HN: QKV Core – Run 7B LLMs on 4GB VRAM via surgical memory alignment
1 points
posted 13 hours ago
by broxytr
(github.com)
No comments yet