Embedding Index Size Calculator

Calculate vector index size in memory and on disk for flat, HNSW, and IVF.

Ad placeholder (leaderboard)

Know how much RAM your vectors need before you provision it

Vector indexes are memory-hungry, and the bill scales with three numbers: how many vectors you store, how many dimensions each has, and how you store them. On top of that, the index structure itself adds overhead — a lot for HNSW, very little for flat or IVF. This calculator combines all of that into a single size estimate so you can pick an instance class or storage tier with confidence instead of finding out after an out-of-memory crash.

How it works

The raw vector storage is vector count times dimensions times bytes per element, where float32 is 4 bytes, float16 is 2, and int8 is 1. On top of that the tool adds index-specific overhead. A flat index adds essentially nothing — it just scans the raw vectors. HNSW adds a graph: roughly M × 2 links per node, each a 4-byte node ID, which is why HNSW indexes are markedly larger and why the M parameter matters. IVF adds only small centroid storage for its clusters. The totals are summed and shown in GiB or MiB along with the breakdown.

Tips and notes

These are estimates of the dominant costs; real databases add metadata, payloads, deletion tombstones, and allocator overhead, so add headroom of 20–50% on top. The biggest lever is precision: moving from float32 to int8 quarters the raw storage, and product quantization can compress further at some recall cost. HNSW gives the best recall and speed but the highest memory — lower M to save RAM, raise it for recall. IVF and flat trade query speed for a smaller footprint. Use the calculator to compare configurations side by side before committing to a deployment size.

Ad placeholder (rectangle)