raullenchai/Rapid-MLX

by raullenchai · Runners · updated today

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

momentum

3,438

stars

397

forks

#21

rank

apple-siliconclaude-codecursordeepseekfastapihacktoberfestinferencellmlocal-llmm1m2m3

View on GitHub →

raullenchai/Rapid-MLX

More in Runners