Local LLM Memory Calculator

Estimate memory requirements for running LLMs locally. Compare models, quantizations, and find what fits your hardware.

1. Select Model

Weight Quantization

KV Cache Quantization

Context Length (K tokens)

Batch Size

Available Memory (GB)

On Apple Silicon, this is your unified memory. On discrete GPU, this is your GPU VRAM.