The Local LLM Index / Quantization & Formats / #187

SqueezeAILab/SqueezeLLM

by SqueezeAILab · Quantization & Formats · updated 1y ago

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

32
momentum
722
stars
50
forks
#187
rank
efficient-inferencelarge-language-modelsllamallmlocalllmmodel-compressionnatural-language-processingpost-training-quantizationquantizationsmall-modelstext-generationtransformer
View on GitHub →