DFlash Demo on TPU
Real-time inference comparison, Qwen3-4B on TPU v5p
DONE
Autoregressive (Baseline)
0
tok/s
0%
progress
DONE
DFlash
--
0
tok/s
0%
progress
Play
Reset
0.0s