The Local LLM Index / Quantization & Formats / #111

Thireus/GGUF-Tool-Suite

by Thireus · Quantization & Formats · updated 1d ago

Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input a target size and the toolchain will create a GGUF recipe tuned to your hardware within seconds — flexible model sizing and lowest achievable perplexity/kld for GGUF enthusiasts seeking precise and automated dynamic quant production.

56
momentum
137
stars
18
forks
#111
rank
View on GitHub →