The Local LLM Index / Quantization & Formats / #101

gpustack/gguf-parser-go

by gpustack · Quantization & Formats · updated 8d ago

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

58
momentum
270
stars
24
forks
#101
rank
ggufgollama-boxllama-cppstable-diffusion-cpp
View on GitHub →