can-i-run-this-llm

GPU comparison

MacBook Pro M4 Pro (48GB) vs NVIDIA RTX 4090 (24GB) for local LLMs

How the two stack up for running open-source LLMs locally — memory, bandwidth, price, and how many of the 18 tracked models each one can run.

MacBook Pro M4 Pro (48GB)NVIDIA RTX 4090 (24GB)
Memory48.0 GB24.0 GB
Bandwidth273 GB/s1008 GB/s
Price (approx)$2,399$1,599
LLMs it runs15 of 1815 of 18
Best model it runsGemma 4 31B · 9–13 tok/sGemma 4 31B · 33–50 tok/s

Both run the same number of models, but the MacBook Pro M4 Pro (48GB) has more memory (48.0 GB), so it can hold larger models at higher quality. The NVIDIA RTX 4090 (24GB) has more memory bandwidth (1008 GB/s), so it generates tokens faster at the same model and quant.

What the MacBook Pro M4 Pro (48GB) runs →What the NVIDIA RTX 4090 (24GB) runs →Open the calculator →