The Local LLM Index / Quantization & Formats / #180
mit-han-lab/TinyChatEngine
by mit-han-lab · Quantization & Formats · updated 1y ago
TinyChatEngine: On-Device LLM Inference Library
34
momentum
954
stars
99
forks
#180
rank
armccppcuda-programmingdeep-learningedge-computinglarge-language-modelson-device-aiquantizationx86-64
View on GitHub →