Hackernews
new
show
ask
jobs
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
8 points
posted 2 days ago
by sarkory
(github.com)
1 Comments
zexinwu
2 days ago
[dead]