Question 1

What is a system prompt and why does it matter?

Accepted Answer

A system prompt is the instruction set given to the model before any user message. It establishes the assistant's persona, capabilities, constraints, and safety rules for the whole conversation. Because it frames every response, a well-designed system prompt is the highest-leverage place to control behaviour consistently.

Question 2

How long should a system prompt be?

Accepted Answer

As long as it needs to be and no longer. Cover persona, scope, output format, and safety clearly, but every extra token costs money on every turn and can dilute the model's focus. Favour crisp, ordered instructions over rambling prose, and cut anything the model already does reliably without being told.

Question 3

How do I stop the model from doing things outside its job?

Accepted Answer

Scope capabilities explicitly — state what the assistant does and, importantly, what it does not. Add a fallback for out-of-scope requests, such as politely declining and redirecting. Without explicit scoping, models tend to attempt anything asked, which leads to off-brand or unsafe behaviour.

Question 4

Where do safety instructions go?

Accepted Answer

In the system prompt, stated clearly and ideally near the top so they frame everything that follows. Define disallowed content, the refusal style, and how to handle attempts to override the rules. Treat the system prompt as a security boundary and test it against adversarial inputs, since it is a primary defence against misuse.

Question 5

How do I test that a system prompt is robust?

Accepted Answer

Build a test set of normal, edge-case, and adversarial inputs — including prompt-injection and jailbreak attempts — and check that the assistant holds its persona, scope, and safety rules across all of them. Re-run this suite whenever you change the prompt or the model so robustness does not silently regress.

System Prompt Design: A Complete Guide

What a system prompt does

The anatomy of a strong system prompt

Handling edge cases and adversarial input

Testing system prompt robustness