Question 1

What is a vector database?

Accepted Answer

A vector database is a system built to store embeddings — lists of numbers representing meaning — and to find the ones most similar to a query vector quickly. Instead of matching exact values like a normal database, it ranks records by distance in vector space, which is what powers semantic search and recommendations.

Question 2

How is it different from a SQL database?

Accepted Answer

A SQL database is optimised for exact matches, ranges, and joins on structured fields. A vector database is optimised for similarity: given one vector, return the nearest thousands of vectors out of millions. The two solve different problems, and many apps use both side by side.

Question 3

What is approximate nearest-neighbour search?

Accepted Answer

Finding the exact closest vectors among millions is slow, so vector databases use approximate nearest-neighbour (ANN) algorithms. They trade a tiny amount of accuracy for enormous speed, returning almost all of the true nearest results in milliseconds using clever index structures.

Question 4

Do I always need a dedicated vector database?

Accepted Answer

No. For small datasets — up to roughly a hundred thousand vectors — an in-memory library or a Postgres extension like pgvector is often enough. Dedicated vector databases earn their keep at larger scale, with high query volume, frequent updates, or rich metadata filtering.

What Is a Vector Database? How AI Finds Similar Content at Scale

A database built for similarity

Why exact search does not scale

HNSW and IVF indexes

Vector database versus SQL and full-text search

When you actually need one