GB10 | Analysis | SUZUKI SHOTEN

GB10

1 analysis tagged with "GB10"

Gemma 4 MTP on GB10 — Inference 1.83x faster on 26B, 3.52x on 31B Dense

Independent measurement of Google's Gemma 4 MTP drafter on NVIDIA GB10 (GX10). 1.83x speedup on 26B-A4B and 3.52x on 31B Dense. No public independent benchmark of 31B Dense + 1 GPU + MTP could be found, making this the first such measurement. Reproduces and exceeds Google's 'up to 3x' claim on real hardware.

2026-05-07 Read →