Gemma 4 MTP on GB10 — Inference 1.83x faster on 26B, 3.52x on 31B Dense
Independent measurement of Google's Gemma 4 MTP drafter on NVIDIA GB10 (GX10). 1.83x speedup on 26B-A4B and 3.52x on 31B Dense. No public independent benchmark of 31B Dense + 1 GPU + MTP could be found, making this the first such measurement. Reproduces and exceeds Google's 'up to 3x' claim on real hardware.
Read →