The Local LLM Index / Runners / #24
raullenchai/Rapid-MLX
by raullenchai · Runners · updated today
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.
72
momentum
2,773
stars
341
forks
#24
rank
apple-siliconclaude-codecursordeepseekfastapihacktoberfestinferencellmlocal-llmm1m2m3
View on GitHub →