The Local LLM Index / Inference Engines / #48

LiangSu8899/FlashRT

by LiangSu8899 · Inference Engines · updated today

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B

68
momentum
328
stars
39
forks
#48
rank
cudacuda-kernelsgr00tgr00t-n1-6-3bjetsonjetson-orinjetson-thormotuspipi05qwenqwen3-6
View on GitHub →