Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

8 pointsposted 10 months ago
by sarkory

1 Comments