Hackernews
new
show
ask
jobs
QuantumLeap: 2.3× faster MoE inference with intelligent expert caching
1 points
posted 7 hours ago
by ikharoz
(github.com)
No comments yet