The Local LLM Index / Inference Engines / #54

alibaba/rtp-llm

by alibaba · Inference Engines · updated today

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

67
momentum
1,220
stars
212
forks
#54
rank
gptinferencellamallmllm-servingllmopsmodel-serving
View on GitHub →