Topic

#Mooncake

1 article exploring Mooncake. Expert insights and analysis from our editorial team.

Showing 1โ€“1 of 1 articles

Articles

Newest first
AI Infrastructure

Prefill-Decode Disaggregation: The Architecture Shift Redefining LLM Serving at Scale

Prefill-decode disaggregation separates compute-bound prefill from memory-bound decode onto dedicated hardware, eliminating phase interference.

ยท 9 min read