Question 1

What is overfitting?

Accepted Answer

Overfitting is when a model learns the training data too well — including its noise and quirks — so it performs excellently on that data but poorly on new, unseen data. The model has memorised rather than generalised, and its real-world accuracy suffers as a result.

Question 2

What is the difference between overfitting and underfitting?

Accepted Answer

Underfitting is the opposite problem: the model is too simple or undertrained to capture the real patterns, so it performs poorly on both training and new data. Overfitting performs well on training data but badly on new data. The goal is the balance in between, where the model has learned the genuine signal without the noise.

Question 3

How do I detect overfitting?

Accepted Answer

Track training loss and validation loss together during training. While both fall, the model is learning. When training loss keeps dropping but validation loss flattens and then rises, the gap between them is overfitting — the model is improving on data it has seen at the expense of data it has not.

Question 4

How do I prevent overfitting?

Accepted Answer

Common techniques are getting more or more varied training data, regularisation such as weight decay (penalising large weights) and dropout (randomly disabling units during training), early stopping (halting when validation loss starts to rise), and reducing model complexity. Data augmentation and cross-validation also help.

Overfitting (AI Glossary)

What overfitting is

The opposite failure: underfitting

How to detect it

How to prevent it

Why it matters for LLMs and fine-tuning