Windows-1252 is the encoding behind countless mojibake bugs. This tool dumps the exact CP1252 byte for every character so you can see precisely how text like smart quotes and the euro sign is stored — and why it breaks when misread.
How it works
Windows-1252 is a single-byte encoding. For most of its range it matches ISO-8859-1 exactly, so the mapping is trivial:
0x00–0x7F : same as ASCII
0xA0–0xFF : same as ISO-8859-1
0x80–0x9F : CP1252-specific printable characters
The crucial part is that 0x80–0x9F range. Where Latin-1 reserves those bytes
for control codes, CP1252 reuses them for €, –, —, “, ”, ™ and other
common symbols. The tool maps each character through this table; characters with
no CP1252 byte are shown with a dash.
Tips and notes
- The euro sign
€is byte0x80in CP1252 but does not exist in Latin-1 — a classic source of broken imports. - Curly quotes (
“ ” ‘ ’) and the em dash (—) live in the special range, so text from Microsoft Word often needs CP1252-aware handling. - A dash in the dump tells you the character is outside CP1252 entirely and would need UTF-8 to represent.