The Local LLM Index / Quantization & Formats / #180

mit-han-lab/TinyChatEngine

by mit-han-lab · Quantization & Formats · updated 1y ago

TinyChatEngine: On-Device LLM Inference Library

34
momentum
954
stars
99
forks
#180
rank
armccppcuda-programmingdeep-learningedge-computinglarge-language-modelson-device-aiquantizationx86-64
View on GitHub →