The Local LLM Index / Quantization & Formats / #15

cactus-compute/cactus

by cactus-compute · Quantization & Formats · updated today

Low-latency AI engine for mobile devices & wearables

74
momentum
5,339
stars
428
forks
#15
rank
aiandroidarmedgeedge-aiframeworkiosllamacppllmllm-inferencellmsmobile
View on GitHub →