Homoglyph Detector

Highlight Unicode chars that look like ASCII but aren't

Ad placeholder (leaderboard)

Homoglyphs are characters from scripts such as Cyrillic or Greek that render almost identically to ordinary ASCII letters. Attackers exploit them to forge domains and usernames that look genuine but point somewhere else. This tool scans your text, highlights every confusable character, and tells you the real code point and which ASCII letter it imitates.

How it works

The detector walks through your text one character at a time. For each character it checks whether the code point is outside the ASCII range and appears in a curated confusables table:

for each char:
  if codePoint > 0x7F and char is in CONFUSABLES table:
    flag it -> report code point + the ASCII letter it imitates

The table is built from the most frequently abused entries in the Unicode confusables data: Cyrillic letters like а, е, о, р, с; Greek letters like ο and ν; full-width Latin forms; and a handful of mathematical look-alikes. Flagged characters are highlighted in the rendered output and listed with their position.

Notes and example

Paste a string like the Cyrillic-spoofed раypal.com and the leading р and а light up as confusables for ASCII p and a. This is exactly the trick behind IDN homograph phishing. Because Unicode defines thousands of confusables and this tool focuses on the common ones, treat a clean result as reassuring but not absolute, and always verify suspicious links by other means.

Ad placeholder (rectangle)