The Local LLM Index / Quantization & Formats / #216
inferflow/inferflow
by inferflow · Quantization & Formats · updated 2y ago
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
27
momentum
251
stars
24
forks
#216
rank
baichuan2bloomdeepseekfalcongemmainternlmllama2llamacppllm-inferencem2m100minicpmmistral
View on GitHub →