can-i-run-this-llm

GPU comparison

Mac Studio M2 Ultra (64GB) vs NVIDIA RTX Pro 6000 Blackwell (96GB) for local LLMs

How the two stack up for running open-source LLMs locally — memory, bandwidth, price, and how many of the 18 tracked models each one can run.

Mac Studio M2 Ultra (64GB)NVIDIA RTX Pro 6000 Blackwell (96GB)
Memory64.0 GB96.0 GB
Bandwidth800 GB/s1792 GB/s
Price (approx)$3,999$8,500
LLMs it runs16 of 1816 of 18
Best model it runsGemma 4 31B · 15–23 tok/sGemma 4 31B · 18–28 tok/s

Both run the same number of models, but the NVIDIA RTX Pro 6000 Blackwell (96GB) has more memory (96.0 GB), so it can hold larger models at higher quality. The NVIDIA RTX Pro 6000 Blackwell (96GB) has more memory bandwidth (1792 GB/s), so it generates tokens faster at the same model and quant.

What the Mac Studio M2 Ultra (64GB) runs →What the NVIDIA RTX Pro 6000 Blackwell (96GB) runs →Open the calculator →