The Local LLM Index / Quantization & Formats / #95

Picovoice/picollm

by Picovoice · Quantization & Formats · updated 2d ago

On-device LLM Inference Powered by X-Bit Quantization

60
momentum
312
stars
25
forks
#95
rank
compressionefficient-inferencegemmagenerative-ailanguage-modellanguage-modelslarge-language-modelllamallama2llama3llmllm-inference
View on GitHub →