The Local LLM Index / Quantization & Formats / #143

HaujetZhao/Qwen3-TTS-GGUF

by HaujetZhao · Quantization & Formats · updated 1mo ago

最极速的Qwen3-TTS推理方案。将 Qwen3-TTS 的 LLM 部分导出为 GGUF，用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。

momentum

168

stars

forks

#143

rank

More in Quantization & Formats