Open Graph and Twitter Card meta tags control how your pages appear when shared, but they also publish structured data into the page source where any scraper can read it. It is easy to leak a contact email, a phone number, a staff name, or an internal asset path into these tags without noticing. This auditor parses the tags you paste and flags the patterns most likely to over-expose content.
How it works
The tool parses your pasted HTML with the browser’s own parser and collects every <meta> element whose property or name begins with og:, article:, profile:, book:, or twitter:. It then runs each tag’s value through a set of privacy heuristics:
- Direct contact data —
og:email,og:phone_number, and any value matching an email or phone pattern. - Location data —
og:latitude,og:longitude,place:location:*, and similar geolocation properties. - Personal names —
article:author,profile:first_name,profile:last_name,og:profile, and author-style values. - Infrastructure leakage —
og:image,og:url, andog:videoURLs whose path looks like an internal upload path, contains a numeric user ID, exposes a cloud bucket host, or uses a non-public CDN pattern.
Each match is reported with the offending tag and a one-line reason. The check is heuristic by design: it surfaces candidates for a human to confirm rather than asserting a definite breach.
Example
Given:
<meta property="og:email" content="[email protected]" />
<meta property="og:image" content="https://internal-cdn.example.s3.amazonaws.com/users/4821/avatar.png" />
The auditor flags the first tag for publishing a direct contact address into the page source, and the second for an image URL that exposes a raw S3 bucket host and a numeric user ID in the path, which a scraper could enumerate.
Tips and notes
- Move contact details into a server-side form or an obfuscated, rate-limited endpoint rather than a public meta tag.
- Serve social preview images from a neutral public path or a dedicated image proxy so the URL reveals nothing about your storage.
- Re-run the audit on staging just before launch — preview tags are often added late and rarely reviewed.
- All parsing is local; the tool never fetches or uploads anything.