The Local LLM Index / Inference Engines / #221

lumia431/photon_infer

by lumia431 · Inference Engines · updated 5mo ago

A High-Performance LLM Inference Engine with vLLM-Style Continuous Batching

26
momentum
111
stars
7
forks
#221
rank
ai-infracontinuous-batchinginference-enginellm-inferencemodern-cpppaged-attentionvllm
View on GitHub →