Catch model bias before it reaches production
Bias rarely announces itself. A model can post excellent aggregate accuracy while quietly performing far worse for a subgroup that is under-represented in the training data — and you will not see it unless you deliberately look. This checklist gives you a systematic review to run before deployment, covering the five places bias actually hides: the data, the labels, the metrics, the feedback loops, and the documentation.
It is built to produce evidence, not just a green tick. The exported record becomes part of your model card or fairness report, demonstrating that you examined the right questions.
How a structured bias review works
A good review walks the model’s whole lifecycle rather than fixating on one fairness number:
- Training-data representativeness — does the data reflect the population the model will serve, including the tails? Under-representation is the single most common root cause.
- Protected attributes and proxies — you need group labels available to measure parity, even if you exclude them from the model. Watch for proxies that smuggle the attribute back in.
- Metric choice — demographic parity, equalised odds and calibration can all be “fair” and mutually exclusive. Pick the one that matches the harm you are trying to prevent, and justify it.
- Feedback loops — will the model’s own outputs shape future training data? If so, small biases compound. Plan for monitoring drift after launch.
- Documentation — record the protected groups considered, the metrics chosen, the thresholds, and the residual risk you accepted.
Notes and tips
- Always slice your metrics by subgroup; aggregate accuracy hides the failures that matter for fairness.
- Removing a protected attribute is usually the wrong fix — keep it for measurement and address bias at the data, objective or threshold level.
- For generative models, bias shows up as representational harm (stereotyped or skewed outputs) rather than classification error; test prompts across groups.
- This checklist documents process, not proof. Treat residual bias as a risk you consciously accept and monitor, and pair it with the AI Risk Classifier for the regulatory angle.