Hackernews
new
show
ask
jobs
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
8 points
posted 10 months ago
by sarkory
(github.com)
1 Comments
zexinwu
10 months ago
[dead]