Question 1

How much did it cost to train GPT-4?

Accepted Answer

OpenAI has not published an official figure, but credible estimates put GPT-4's training compute cost in the range of roughly 50 to 100 million US dollars or more, just for the final training run. That excludes the much larger ongoing costs of research, failed experiments, data, salaries, and inference, which together can dwarf the single-run figure.

Question 2

What drives the cost of training a language model?

Accepted Answer

Cost is dominated by compute: the number of mathematical operations (FLOPs) needed, which scales with model size times the amount of training data. That compute is rented or owned as GPU-hours, so total cost is roughly model size, data volume, hardware efficiency, and the price per GPU-hour multiplied together, plus energy and engineering overhead.

Question 3

What is the Chinchilla rule for training cost?

Accepted Answer

DeepMind's 2022 Chinchilla research found that for a fixed compute budget, models were often too large and under-trained. The compute-optimal recipe scales training tokens and parameters together, roughly 20 tokens per parameter. Following it gets the best performance per dollar of compute rather than just building the biggest possible model.

Question 4

Can you train an LLM cheaply?

Accepted Answer

Training a frontier model from scratch costs tens of millions, but smaller open-source models can be trained for thousands to low millions, and fine-tuning an existing open model can cost only hundreds of dollars. Most builders never train from scratch; they fine-tune or simply call an API, which avoids the huge upfront compute bill entirely.

How Much Does It Cost to Train an LLM?

What you are actually paying for

The FLOPs formula

The Chinchilla compute-optimal insight

Real-world cost estimates

Why most builders never pay it