CROWDSOURCED BENCHMARKS
Real measured speeds
Tokens/sec that real people measured on their own machines, next to our estimate. Numbers from 1–2 reports are marked prelim until a third confirms them. Ran a model yourself? Add your benchmark →
🔬 3 benchmarks · 2 devices · 1 model
· 2 rows
| MacBook Pro M2 Max (32GB) | Llama 3.1 8B Q4_K_M | 49.3prelim | 48 | +3% | 2 |
| NVIDIA RTX 4090 (24GB) | Llama 3.1 8B Q4_K_M | 130.2prelim | 125 | +4% | 1 |