The Local LLM Index / Inference Engines / #227

NEO-MLSys25/NEO

by NEO-MLSys25 · Inference Engines · updated 12mo ago

NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading

momentum

stars

forks

#227

rank

More in Inference Engines