DFlash Demo on TPU

Real-time inference comparison, Qwen3-4B on TPU v5p
DONE
Autoregressive (Baseline)
0
tok/s
0%
progress
DONE
DFlash--
0
tok/s
0%
progress
0.0s