Question 1

What is a diffusion model?

Accepted Answer

A diffusion model is a generative model that learns to create data by reversing a noise-adding process. It is trained to predict and remove noise, so at generation time it can start from pure random noise and gradually denoise it into a coherent image, audio clip, or video.

Question 2

What are the forward and reverse processes?

Accepted Answer

The forward process is a fixed Markov chain that slowly adds small amounts of Gaussian noise to real data over many steps until it becomes pure noise. The reverse process is what the model learns: a step-by-step denoising that starts from noise and recovers a sample from the data distribution.

Question 3

How are diffusion models different from GANs?

Accepted Answer

GANs generate samples in a single forward pass through a generator trained against a discriminator, which can be unstable. Diffusion models generate iteratively over many denoising steps and are trained with a stable regression-style objective, which usually gives higher sample quality and diversity at the cost of slower sampling.

Question 4

What are diffusion models used for?

Accepted Answer

They power leading image generators such as Stable Diffusion and DALL-E, as well as audio and video synthesis tools. Their iterative denoising approach produces high-fidelity, diverse outputs and can be conditioned on text prompts, sketches, or other inputs.

Diffusion Model (AI Glossary)

Definition

The forward (noise-adding) process

The reverse (denoising) process

Why the step-by-step approach works

Why it matters