The Local LLM Index / Runners / #24

raullenchai/Rapid-MLX

by raullenchai · Runners · updated today

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

72
momentum
2,773
stars
341
forks
#24
rank
apple-siliconclaude-codecursordeepseekfastapihacktoberfestinferencellmlocal-llmm1m2m3
View on GitHub →