The Local LLM Index / Inference Engines / #221
lumia431/photon_infer
by lumia431 · Inference Engines · updated 5mo ago
A High-Performance LLM Inference Engine with vLLM-Style Continuous Batching
26
momentum
111
stars
7
forks
#221
rank
ai-infracontinuous-batchinginference-enginellm-inferencemodern-cpppaged-attentionvllm
View on GitHub →