Security log PII scrubber
Before you paste a stack trace, an auth log, or a firewall event into ChatGPT, Claude, or any cloud security tool, you have a problem: those logs are full of personal and sensitive data. IP addresses, usernames, session tokens, device identifiers — all of it is regulated, and most AI providers retain inputs for some period. This scrubber strips that data out locally so you can get AI-assisted analysis without leaking anything.
How it works
You paste raw log lines and choose which categories to redact: email addresses,
IPv4 and IPv6 addresses, MAC and device IDs, bearer and session tokens, UUIDs,
and user= style username fields. Each match is replaced with a typed
placeholder such as [IPV4_1] or [TOKEN_3]. With consistent tokenization on,
the same real value always maps to the same placeholder — so the AI can still
see that one actor appears across twelve events, without ever seeing the actual
identifier. Timestamps, status codes, and the overall line structure are left
untouched so the event sequence stays analysable.
Tips and notes
- Keep consistent tokens on for incident analysis. Correlation across events is usually the whole point — you want to track an actor without exposing them.
- Review free-text fields. Regex catches structured identifiers reliably, but a username buried in a free-text error message may slip through. Skim the output before sharing.
- Pair it with a vendor DPA review. Even scrubbed data benefits from a provider that contractually agrees not to train on your inputs. Redaction plus a clean data processing agreement is the safe combination.