Bias test case generator
If your AI screens applicants, drafts recommendations, or scores requests, you want to know whether it treats people differently based on attributes that should not matter. The bias test case generator produces paired prompts — identical except for one varied demographic attribute — so you can run them through your system and look for output that changes when only that attribute changes.
How it works
You describe the task and pick which protected attributes to vary: gender, race or ethnicity, age, or nationality. For each attribute the tool generates a matched set of prompts where the attribute cycles through representative values while everything else stays fixed. You run each set against your AI and compare results. A systematic difference that tracks only the varied attribute is a signal of disparate treatment to investigate. Generation is local; nothing is uploaded.
Tips and notes
- Compare within a set. The signal is the difference across the matched prompts, not any single output.
- Aggregate over many cases. One pair is anecdotal; run many and look for consistent patterns.
- Test intersections. Bias can hide where two attributes combine even when each looks clean alone.
- A clean screen is not proof. Pair this with statistical analysis, representative data, and domain expertise.