The Local LLM Index / Local UIs & Apps / #211

modelscope/dash-infer

by modelscope · Local UIs & Apps · updated 10mo ago

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

27
momentum
273
stars
28
forks
#211
rank
cpucudaguided-decodingllmllm-inferencenative-engine
View on GitHub →