Question 1

What is a convolutional neural network in simple terms?

Accepted Answer

A convolutional neural network is a deep learning model designed for grid-like data such as images. It uses small filters that slide across the input to detect local patterns like edges and textures, then stacks layers to combine those patterns into shapes and objects. This structure makes it extremely effective at recognising what is in an image.

Question 2

What does a convolutional layer actually do?

Accepted Answer

A convolutional layer applies a set of small learnable filters across the image, computing a response at each position to produce a feature map highlighting where a pattern appears. Because the same filter is reused everywhere, the layer detects a pattern regardless of its location, which is efficient and gives the network translation invariance.

Question 3

What is pooling and why is it used?

Accepted Answer

Pooling downsamples a feature map, for example by keeping the maximum value in each small region. It shrinks the spatial size so deeper layers see a wider context with less computation, and it makes the representation more robust to small shifts and distortions in the input. Max pooling and average pooling are the most common forms.

Question 4

What is a receptive field?

Accepted Answer

A receptive field is the region of the original input that influences a particular feature in a deeper layer. Early layers have small receptive fields and see local detail; as you stack convolution and pooling, the receptive field grows so later neurons respond to large, complex patterns spanning much of the image. This expanding view is how a CNN builds from edges to whole objects.

What Is a CNN? Convolutional Neural Networks Explained

What a CNN is

Convolutional layers and filters

Pooling, receptive fields, and the feature hierarchy

Landmark architectures

Where CNNs stand today