The Local LLM Index / Quantization & Formats / #201
dusty-nv/NanoLLM
by dusty-nv · Quantization & Formats · updated 1y ago
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
29
momentum
376
stars
65
forks
#201
rank
edge-aillm-inferencemultimodalragspeechvector-databasevision-transformer
View on GitHub →