Question 1

What is fine-tuning in simple terms?

Accepted Answer

Fine-tuning takes a model that was already pre-trained on huge amounts of general text and continues training it on a smaller, focused dataset of examples. This nudges the model's weights so it gets better at a specific task, tone, or format than the general-purpose base model.

Question 2

What is supervised fine-tuning (SFT)?

Accepted Answer

Supervised fine-tuning trains the model on labelled input-output pairs — for example, a customer message paired with the ideal reply. The model learns to reproduce the desired outputs, which is the most common and straightforward form of fine-tuning for shaping behaviour.

Question 3

How much data do I need to fine-tune?

Accepted Answer

It depends on the goal, but useful fine-tuning often starts in the hundreds to low thousands of high-quality, consistent examples rather than millions. Data quality and consistency matter far more than raw volume — a few hundred clean, on-target examples usually beat tens of thousands of noisy ones.

Question 4

What is LoRA and why is it popular?

Accepted Answer

LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning method that trains only a small set of added weights instead of the whole model. It cuts the memory and compute needed dramatically, makes fine-tuning affordable on modest hardware, and produces small, swappable adapter files rather than a full new model copy.

What Is Fine-Tuning an AI Model?

Fine-tuning, defined

Supervised fine-tuning and the data it needs

The cost: compute and parameter-efficient methods

When fine-tuning is the right tool