The Local LLM Index / Quantization & Formats / #184

OpenGVLab/OmniQuant

by OpenGVLab · Quantization & Formats · updated 6mo ago

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

33
momentum
899
stars
82
forks
#184
rank
large-language-modelsllmquantization
View on GitHub →