Unicode character inspector
When text behaves unexpectedly — a string length is off by one, a regex misses an accented letter, or an invisible character breaks a copy-paste — the cause is usually hidden at the code-point level. This inspector splits any text into its individual Unicode code points and shows the U+ hex value, decimal value and category of each, making stray and invisible characters immediately visible.
How it works
The tool iterates your text by code point (using JavaScript’s
surrogate-pair-aware string iteration), so characters above U+FFFF are handled
correctly. For each character it reads codePointAt(0), formats it as
U+XXXX hexadecimal, and classifies it with Unicode property escapes:
\p{L} letter, \p{N} number, \p{P} punctuation, \p{S} symbol, \p{M} mark,
\p{Z} separator, with explicit labels for space, tab, line feed, carriage
return and control characters. The summary contrasts the code-point count with
the raw UTF-16 unit count so you can see where a string’s length will surprise
you.
Example
Inspecting Café — 日本語 😀 gives 9 code points but more UTF-16 units, because
the emoji 😀 is a single code point (U+1F600) stored as two UTF-16 units. The
é shows as U+00E9 (a Letter), the em dash — as U+2014 (Punctuation), each
CJK character as a Letter in the U+65E5 range, and the trailing space as a labelled
␠ (space).
| Character | Code point | Category |
|---|---|---|
| C | U+0043 | Letter |
| é | U+00E9 | Letter |
| — | U+2014 | Punctuation |
| 😀 | U+1F600 | Symbol |
Everything runs in your browser — your text never leaves your device.