The Local LLM Index / Inference Engines / #107
HJCheng0602/nanoPD
by HJCheng0602 · Inference Engines · updated 1mo ago
A from-scratch Prefill/Decode disaggregation inference engine for LLMs
57
momentum
156
stars
27
forks
#107
rank
decodeinferenceprefill
View on GitHub →