Question 1

What is prompt injection and why is it the top AI risk?

Accepted Answer

Prompt injection is when untrusted text — a user message, a web page, an email, a retrieved document — contains instructions that the model follows as if they came from you. It is the AI equivalent of SQL injection, and it tops the OWASP LLM risk list because models cannot reliably tell trusted instructions from untrusted data in the same text stream. The only durable defence is to never grant the model authority it could be tricked into misusing.

Question 2

How do I actually defend against prompt injection?

Accepted Answer

Treat all external content as untrusted data, keep your real instructions in the system prompt, and clearly delimit user content so the model knows where data ends. Most importantly, constrain what the model can do — validate and authorise every tool call and database action on the server against the real user's permissions, never on the model's say-so. Defence in depth wins; no single prompt trick fully stops injection.

Question 3

How do I stop the model from leaking PII or secrets?

Accepted Answer

Minimise what you send — redact or tokenise personal data before it reaches the model, and never put secrets, keys, or other users' data in the context. On output, scan for and mask PII before displaying or logging, and be careful that error messages and logs do not capture raw prompts containing personal data. The model can only leak what you let into its context, so the strongest control is upstream.

Question 4

Where should API keys live, and what if they are exposed?

Accepted Answer

Server-side only, in a secret manager or environment variables, never in client code, mobile bundles, or Git. The browser should call your backend, which holds the key; if a key ever appears in client-shipped code it is compromised the moment it deploys. Rotate keys regularly, scope them to least privilege, and have a rotation runbook ready so a leak is a quick rotation rather than a crisis.

Question 5

Why does rate limiting matter for security, not just cost?

Accepted Answer

An unthrottled AI endpoint enables denial-of-wallet attacks, where an attacker drives up your bill, and brute-force jailbreak attempts that hammer the model until something slips through. Per-user and per-IP limits, plus anomaly alerts on sudden spend or request spikes, blunt both. Rate limiting is a security control as much as a cost control, because abuse and cost are the same attack from the model's point of view.

How to Secure AI Applications Against Common Attacks

The AI attack surface is different

Prompt injection and excessive agency

Data leakage and output handling

Keys, rate limits, and abuse