Unicode Character Inspector

See the code point, hex, and category of every character.

Ad placeholder (leaderboard)
Enjoying the tools? Go Pro for £4.99 (one-time) and remove all ads — forever, on this device. Remove ads — £4.99

Unicode character inspector

When text behaves unexpectedly — a string length is off by one, a regex misses an accented letter, or an invisible character breaks a copy-paste — the cause is usually hidden at the code-point level. This inspector splits any text into its individual Unicode code points and shows the U+ hex value, decimal value and category of each, making stray and invisible characters immediately visible.

How it works

The tool iterates your text by code point (using JavaScript’s surrogate-pair-aware string iteration), so characters above U+FFFF are handled correctly. For each character it reads codePointAt(0), formats it as U+XXXX hexadecimal, and classifies it with Unicode property escapes: \p{L} letter, \p{N} number, \p{P} punctuation, \p{S} symbol, \p{M} mark, \p{Z} separator, with explicit labels for space, tab, line feed, carriage return and control characters. The summary contrasts the code-point count with the raw UTF-16 unit count so you can see where a string’s length will surprise you.

Example

Inspecting Café — 日本語 😀 gives 9 code points but more UTF-16 units, because the emoji 😀 is a single code point (U+1F600) stored as two UTF-16 units. The é shows as U+00E9 (a Letter), the em dash as U+2014 (Punctuation), each CJK character as a Letter in the U+65E5 range, and the trailing space as a labelled ␠ (space).

CharacterCode pointCategory
CU+0043Letter
éU+00E9Letter
U+2014Punctuation
😀U+1F600Symbol

Everything runs in your browser — your text never leaves your device.

Ad placeholder (rectangle)