Question 1

What is an autoencoder in simple terms?

Accepted Answer

An autoencoder is a neural network trained to copy its input to its output, but forced to pass the data through a narrow bottleneck in the middle. Because the bottleneck has fewer dimensions than the input, the network must learn an efficient compressed representation that keeps only the most important information. It learns this entirely from the data, without any labels.

Question 2

What is the bottleneck or latent space?

Accepted Answer

The bottleneck, also called the latent space or code, is the small middle layer of an autoencoder where the compressed representation lives. Each input is squeezed into this low-dimensional vector by the encoder, and the decoder reconstructs the original from it. The latent space captures the data's underlying structure in far fewer numbers than the raw input.

Question 3

How is a variational autoencoder (VAE) different?

Accepted Answer

A standard autoencoder maps each input to a single point in latent space, which is good for compression but not for generating new samples. A VAE instead maps each input to a probability distribution and adds a regularising term so the latent space is smooth and continuous. You can then sample from that space to generate entirely new, realistic outputs.

Question 4

Where are autoencoders used in modern AI?

Accepted Answer

Autoencoders are used for dimensionality reduction, anomaly detection, denoising, and as the compressor inside generative systems. Stable Diffusion and other latent diffusion models use a VAE to shrink images into a compact latent space, run the diffusion process there cheaply, and then decode back to full-resolution images.

What Is an Autoencoder? Compression and Generation in Neural Networks

The core idea

Encoder, bottleneck, decoder

What the latent space captures

Variational autoencoders and generation

Why this matters for modern image AI