Question 1

What is a vector database?

Accepted Answer

A vector database is a system designed to store embedding vectors and find the ones most similar to a query vector quickly. Instead of matching exact values, it searches by geometric closeness, which is how it powers semantic search and retrieval for AI applications.

Question 2

How does a vector database search so fast?

Accepted Answer

It uses approximate nearest-neighbour (ANN) algorithms such as HNSW or IVF that build an index over the vectors. These trade a tiny amount of accuracy for enormous speed, returning close matches across millions of vectors in milliseconds.

Question 3

What are popular vector databases?

Accepted Answer

Common options include managed services like Pinecone and Weaviate, the open-source engine Qdrant, and pgvector, an extension that adds vector search to PostgreSQL. The right choice depends on scale, hosting preference, and existing infrastructure.

Question 4

Why do RAG pipelines need a vector database?

Accepted Answer

Retrieval-augmented generation works by embedding your documents, storing them in a vector database, and at query time retrieving the most semantically relevant chunks to feed into the LLM's context. The vector database is the retrieval engine that makes this possible at scale.

Vector Database (AI Glossary)

Definition

How it works

ANN algorithms: HNSW and IVF

Major platforms

Why it matters