Unicode Blocks Reference

Common Unicode blocks with their code-point ranges and contents.

Ad placeholder (leaderboard)
Enjoying the tools? Go Pro for £4.99 (one-time) and remove all ads — forever, on this device. Remove ads — £4.99

Common Unicode blocks reference

Unicode organises its code points into named blocks — contiguous ranges reserved for a related family of characters, such as Basic Latin, Cyrillic, Mathematical Operators or Emoticons. This searchable reference lists the blocks developers and writers meet most often, each with its hexadecimal code-point range and a short note on what it contains, so you can identify where a character lives or which range to use when validating or filtering text.

How it works

Every Unicode character has a single code point, written with a U+ prefix followed by its value in hexadecimal — for example U+0041 is the letter A. Each block is defined by a fixed start and end code point. To find a character’s block, locate the range that contains its code point. The table below is filtered live as you type: it matches on the block name, the range text (so searching U+2200 finds Mathematical Operators), or a content keyword such as arrows or emoji.

Example

The grinning-face emoji 😀 has the code point U+1F600. Scanning the ranges, it falls between U+1F600 and U+1F64F, so it belongs to the Emoticons block. The euro sign €, code point U+20AC, sits inside U+20A0U+20CF, the Currency Symbols block.

BlockRangeContents
Basic Latin (ASCII)U+0000–U+007FEnglish letters, digits, punctuation
CyrillicU+0400–U+04FFRussian and Slavic scripts
Mathematical OperatorsU+2200–U+22FF∑, ∫, ≤, √ and more
CJK Unified IdeographsU+4E00–U+9FFFChinese / Japanese / Korean Han
EmoticonsU+1F600–U+1F64F😀 😂 😍 face emoji

This reference covers the most-used blocks rather than all several hundred in the standard, and runs entirely client-side — no text or data is sent anywhere.

Ad placeholder (rectangle)