can-i-run-this-llm

Rig report

NVIDIA RTX 4070 (12GB)

12.0 GB memory · 504 GB/s · 8 of 18 tracked models fit

Best model it can run

Gemma 4 12BRuns great

Q3_K_M · 5.1 GB · 42–63 tok/s · capability 68/100

Fastest that fits: Llama 3.2 1B at 103–154 tok/s

Also runs

Open the full calculator →Check another machine

Have a NVIDIA RTX 4070 (12GB)? Add your real benchmark → to sharpen these numbers for everyone.

Estimates from device specs and model architectures, refined by community benchmarks. See the methodology.