The Local LLM Index / Quantization & Formats / #193

SqueezeAILab/SqueezeLLM

by SqueezeAILab · Quantization & Formats · updated 1y ago

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

momentum

722

stars

forks

#193

rank

efficient-inferencelarge-language-modelsllamallmlocalllmmodel-compressionnatural-language-processingpost-training-quantizationquantizationsmall-modelstext-generationtransformer

View on GitHub →

SqueezeAILab/SqueezeLLM

More in Quantization & Formats