Show HN: A Zero-Copy 1.58-bit LLM Engine hitting 117 Tokens/s on single CPU core

4 pointsposted 8 hours ago
by dhilipsiva

No comments yet