can-i-run-this-llm

CROWDSOURCED BENCHMARKS

Real measured speeds

Tokens/sec that real people measured on their own machines, next to our estimate. Numbers from 1–2 reports are marked prelim until a third confirms them. Ran a model yourself? Add your benchmark →

🔬 3 benchmarks · 2 devices · 1 model

· 2 rows
MacBook Pro M2 Max (32GB)Llama 3.1 8B Q4_K_M49.3prelim48+3%2
NVIDIA RTX 4090 (24GB)Llama 3.1 8B Q4_K_M130.2prelim125+4%1

Most wanted — no benchmarks yet