The Local LLM Index / Quantization & Formats / #15
cactus-compute/cactus
by cactus-compute · Quantization & Formats · updated today
Low-latency AI engine for mobile devices & wearables
74
momentum
5,339
stars
428
forks
#15
rank
aiandroidarmedgeedge-aiframeworkiosllamacppllmllm-inferencellmsmobile
View on GitHub →