Question 1

What is word2vec in simple terms?

Accepted Answer

Word2vec is a technique from Google in 2013 that turns words into dense numeric vectors so that words used in similar contexts end up close together in vector space. Instead of treating words as isolated symbols, it learns their meaning from the company they keep. The result is a map where geometric distance reflects semantic similarity.

Question 2

What is the difference between skip-gram and CBOW?

Accepted Answer

Both are training objectives for word2vec. Skip-gram takes a single word and tries to predict the surrounding context words, which works well for rare words. CBOW (Continuous Bag of Words) does the reverse, predicting a target word from its surrounding context, and trains faster on frequent words. They produce similar embeddings with different trade-offs.

Question 3

What does king minus man plus woman equals queen mean?

Accepted Answer

It is the famous demonstration that word2vec vectors capture relationships as directions in space. The vector arithmetic king − man + woman lands near the vector for queen, because the model learned that the male-to-female relationship is roughly the same direction across many word pairs. It showed embeddings encode analogies, not just similarity.

Question 4

Is word2vec still used today?

Accepted Answer

It is rarely used directly in cutting-edge systems, which now favour contextual embeddings from transformers like BERT. But word2vec established the core idea of learned dense embeddings that powers modern search, recommendation, and retrieval. It is still taught as the foundational example and is sometimes used where lightweight, static embeddings are enough.

What Is Word2Vec? The Embedding That Started the AI Revolution

The core idea

Skip-gram and CBOW

Vector arithmetic and analogies

Why it mattered

Word2vec’s legacy