The Local LLM Index / Quantization & Formats / #201

dusty-nv/NanoLLM

by dusty-nv · Quantization & Formats · updated 1y ago

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

29
momentum
376
stars
65
forks
#201
rank
edge-aillm-inferencemultimodalragspeechvector-databasevision-transformer
View on GitHub →