Question 1

What is fine-tuning?

Accepted Answer

Fine-tuning is continuing to train an already pre-trained model on a smaller, task-specific dataset so its weights adapt to your domain, style, or format. The model keeps its general knowledge but specialises in your task.

Question 2

What is the difference between fine-tuning and prompting?

Accepted Answer

Prompting changes only the input you send; the model's weights stay fixed. Fine-tuning actually updates the weights, so the behaviour is baked in and works without long example-laden prompts, at lower per-call cost once trained.

Question 3

What are LoRA and QLoRA?

Accepted Answer

LoRA (Low-Rank Adaptation) trains a small set of added weight matrices instead of the whole model, cutting trainable parameters by 99%+. QLoRA combines LoRA with 4-bit quantization so large models can be fine-tuned on a single consumer GPU.

Question 4

When is fine-tuning worth it?

Accepted Answer

Fine-tune when you need consistent format or tone at scale, when prompts would be too long or costly, or when retrieval can't supply the needed behaviour. For knowledge that changes often, retrieval-augmented generation is usually a better fit.

Fine-Tuning (AI Glossary)

Definition

Flavours of fine-tuning

LoRA and QLoRA

Fine-tuning vs. retrieval

When to fine-tune