Question 1

What is T5 in simple terms?

Accepted Answer

T5, the Text-to-Text Transfer Transformer, is a Google model from 2019 that reframes every language task as converting input text into output text. Translation, classification, summarisation, and question answering all use the exact same model, format, and training procedure, distinguished only by a short instruction prefix. This unified view simplified how a single model handles many tasks.

Question 2

What does text-to-text mean?

Accepted Answer

It means both the input and the output are always strings of text, no matter the task. To classify sentiment, T5 reads a prefixed sentence and outputs the word positive or negative; to translate, it reads English text and outputs French text. Casting everything as text-in, text-out lets one architecture and one loss function cover every task.

Question 3

How is T5 different from BERT?

Accepted Answer

BERT is an encoder-only model that produces representations for understanding tasks and cannot generate free-form text on its own. T5 is a full encoder-decoder transformer, so it both understands the input and generates a text output. That makes T5 naturally suited to generative tasks like summarisation and translation as well as classification.

Question 4

What is the C4 dataset?

Accepted Answer

C4, the Colossal Clean Crawled Corpus, is the large web-text dataset Google created to pre-train T5. It is a heavily filtered version of Common Crawl with boilerplate, duplicate, and low-quality text removed. Pre-training on this clean, massive corpus gave T5 broad language ability before fine-tuning on specific tasks.

What Is T5? Text-to-Text Transfer Transformer Explained

The core idea

The text-to-text framing

Encoder-decoder architecture

Span-corruption pre-training and C4

Why T5 mattered