Common Unicode blocks reference
Unicode organises its code points into named blocks — contiguous ranges reserved for a related family of characters, such as Basic Latin, Cyrillic, Mathematical Operators or Emoticons. This searchable reference lists the blocks developers and writers meet most often, each with its hexadecimal code-point range and a short note on what it contains, so you can identify where a character lives or which range to use when validating or filtering text.
How it works
Every Unicode character has a single code point, written with a U+ prefix
followed by its value in hexadecimal — for example U+0041 is the letter A. Each
block is defined by a fixed start and end code point. To find a character’s block,
locate the range that contains its code point. The table below is filtered live as
you type: it matches on the block name, the range text (so searching U+2200
finds Mathematical Operators), or a content keyword such as arrows or emoji.
Example
The grinning-face emoji 😀 has the code point U+1F600. Scanning the ranges,
it falls between U+1F600 and U+1F64F, so it belongs to the Emoticons
block. The euro sign €, code point U+20AC, sits inside U+20A0–U+20CF, the
Currency Symbols block.
| Block | Range | Contents |
|---|---|---|
| Basic Latin (ASCII) | U+0000–U+007F | English letters, digits, punctuation |
| Cyrillic | U+0400–U+04FF | Russian and Slavic scripts |
| Mathematical Operators | U+2200–U+22FF | ∑, ∫, ≤, √ and more |
| CJK Unified Ideographs | U+4E00–U+9FFF | Chinese / Japanese / Korean Han |
| Emoticons | U+1F600–U+1F64F | 😀 😂 😍 face emoji |
This reference covers the most-used blocks rather than all several hundred in the standard, and runs entirely client-side — no text or data is sent anywhere.