GPU VRAM requirements calculator
The first question with any local image model is blunt: will it even run on my card? VRAM is the hard limit — run out and generation crashes outright. The answer depends on the model, the resolution, the batch size, and whether you stack a ControlNet on top. This calculator estimates the peak VRAM a configuration needs and checks it against your GPU so you know before you download an 8GB checkpoint.
How it works
Each model has a baseline weight footprint at its native resolution. The calculator scales the activation memory by pixel area — doubling resolution quadruples activations — and by batch size, since each extra image adds its own latent. A ControlNet loads a second model and adds a fixed chunk on top. The total is compared to your card’s VRAM, and the verdict tells you whether you have headroom, are running tight, or fall short and by how much.
Tips when you’re short on VRAM
- Drop resolution first. It’s the biggest lever — 768px instead of 1024px cuts activations by nearly half on the same model.
- Use fp8 / quantized weights. Flux fp8 nearly halves the requirement versus fp16 with minimal quality loss.
- Enable offloading.
--medvramand--lowvramshuttle weights to system RAM; you trade speed for the ability to run at all. - Shrink the batch. Generating one image at a time frees the most memory — queue a batch sequentially instead of in parallel when VRAM is tight.