The Local LLM Index / Quantization & Formats / #216

inferflow/inferflow

by inferflow · Quantization & Formats · updated 2y ago

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

27
momentum
251
stars
24
forks
#216
rank
baichuan2bloomdeepseekfalcongemmainternlmllama2llamacppllm-inferencem2m100minicpmmistral
View on GitHub →