The Local LLM Index / Inference Engines / #48

LiangSu8899/FlashRT

by LiangSu8899 · Inference Engines · updated today

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B

momentum

328

stars

forks

#48

rank

cudacuda-kernelsgr00tgr00t-n1-6-3bjetsonjetson-orinjetson-thormotuspipi05qwenqwen3-6

View on GitHub →

LiangSu8899/FlashRT

More in Inference Engines