The Local LLM Index / Quantization & Formats / #101
gpustack/gguf-parser-go
by gpustack · Quantization & Formats · updated 8d ago
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
58
momentum
270
stars
24
forks
#101
rank
ggufgollama-boxllama-cppstable-diffusion-cpp
View on GitHub →