Question 1

What is LLaMA in simple terms?

Accepted Answer

LLaMA (Large Language Model Meta AI) is a family of language models released by Meta starting in 2023, with the model weights made available to researchers and later more openly. It showed that relatively small models trained on far more data could rival much larger ones. By putting capable weights in the open, it triggered an explosion of community fine-tunes and tooling.

Question 2

How is LLaMA's architecture different from the original transformer?

Accepted Answer

LLaMA is a decoder-only transformer with a few refinements: it uses RMSNorm instead of standard layer normalisation, the SwiGLU activation in its feed-forward layers, and rotary positional embeddings (RoPE) instead of absolute position encodings. These changes improve training stability and efficiency while keeping the core attention mechanism intact.

Question 3

Why was LLaMA important for open-source AI?

Accepted Answer

Before LLaMA, the most capable models were locked behind paid APIs. LLaMA gave the research and developer community direct access to strong model weights, enabling fine-tuning, quantisation, and on-device deployment. This sparked a wave of derivative models, training techniques, and local-inference tools that pushed open-weight AI forward dramatically.

Question 4

Is LLaMA fully open source?

Accepted Answer

Not in the strict sense. The weights are openly available but released under a custom community licence with usage conditions rather than a standard open-source software licence. In practice it is open enough to fine-tune, run locally, and build products on, which is why it became the foundation for so much of the open-weight ecosystem.

What Is LLaMA? Meta's Open-Source Language Model Explained

The core idea

A refined decoder-only transformer

Compute-optimal training

The ecosystem it unleashed

Licensing and legacy