From 540 to 214: How the Kangxi Radicals Became the Backbone of CJK Lexicography
An Indexing Algorithm That Has Survived 310 Years
In 100 CE, Xu Shen faced an indexing problem: how to organize 9,353 logographic characters for retrieval with no alphabetical order. His Shuowen Jiezi decomposed characters into 540 recurring elements (bu 部). It worked, but the index was overfit -- many radicals classified only a handful of entries.
The Reduction Path
| Year | Dictionary | Radicals | Characters | Innovation |
|---|---|---|---|---|
| 100 CE | Shuowen Jiezi (Xu Shen) | 540 | 9,353 | First radical-based index |
| 1615 | Zihui (Mei Yingzuo) | 214 | 33,179 | Radical-and-stroke sorting |
| 1716 | Kangxi Zidian (Zhang Yushu et al.) | 214 | 47,035 | Imperial standardization |
| 1999 | Unicode 3.0 | 214 | 27,484+ | Digital encoding (U+2F00--U+2FD5) |
| 2009 | PRC Standard GF 0011 | 201 | -- | Simplified-character variant |
Mei Yingzuo's Zihui (1615) applied two principles: eliminate radicals with trivially small character sets, and merge positional variants of the same root. Then sort by residual stroke count. The Kangxi Dictionary (1716, 47,035 characters, twelve volumes) adopted Mei's system -- the imperial imprimatur made it canonical.
Frontispiece of an 1827 reprint of the *Kangxi Zidian (康熙字典). The 1716 original ran to twelve volumes and indexed 47,035 characters under Mei Yingzuo's 214-radical scheme. Source: Wikimedia Commons.*
A Wildly Unbalanced Tree
The top ten radicals alone account for ~23% of all 47,035 entries:
| Rank | # | Radical | Meaning | Kangxi Entries |
|---|---|---|---|---|
| 1 | 140 | 艸 (艹) | grass | 1,902 |
| 2 | 85 | 水 (氵) | water | 1,595 |
| 3 | 75 | 木 | tree | 1,369 |
| 4 | 64 | 手 (扌) | hand | 1,203 |
| 5 | 30 | 口 | mouth | 1,146 |
| 6 | 61 | 心 (忄) | heart | 1,115 |
| 7 | 142 | 虫 | insect | 1,067 |
| 8 | 118 | 竹 | bamboo | 953 |
| 9 | 149 | 言 | speech | 861 |
| 10 | 120 | 糸 | silk | 823 |
Bottom: radical 138 (艮, stopping) holds 5 entries. A 380:1 ratio. Yet worst-case lookup is still O(log n) once the radical is identified, and identification is constant-time for a trained reader.
![]()
The full set of 214 Kangxi radicals rendered in old-style fonts that imitate the original Kangxi Zidian shapes, ordered by stroke count (1 stroke at left, 17 at right). Source: Wikimedia Commons.
Competing Indexing Systems
| System | Year | Keys | Prerequisite | Best For |
|---|---|---|---|---|
| Kangxi radicals | 1716 | 214 + strokes | Memorize 214 + variants | Semantic browsing, print dictionaries |
| Four-corner (Wang Yunwu) | 1926 | 4--5 digit code | 10 stroke-shape rules | Telegraphers, numeric indexing |
| SKIP (Jack Halpern) | 1990 | 4 patterns + strokes | None (geometric division) | Foreign learners |
| Spahn-Hadamitzky | 1996 | 79 radicals | Memorize 79 | Learner dictionaries |
| Cangjie input | 1976 | 24 forms, 5 keys | Decomposition rules | Fast digital input |
SKIP classifies kanji into four geometric patterns -- left-right, up-down, enclosure, solid -- then counts strokes. No radical memorization required. Trade-off: no semantic content. The Kangxi system endures because it functions as both index and ontology.
The Unicode Compromise
The 214 radicals occupy U+2F00--U+2FD5, separate from CJK Unified Ideographs where the same glyphs appear as characters. U+2F00 (⼀) = "Kangxi Radical One" (metadata); U+4E00 (一) = the character "one" (content). Same glyph, different semantics. A supplementary block (U+2E80--U+2EFF) encodes positional variants. The architecture treats radicals as a 300-year-old indexing layer preserved alongside the data it indexes.
Browse our complete radical index to explore all 214.
References
- Xu Shen, Shuowen Jiezi (100 CE)
- Mei Yingzuo, Zihui (1615)
- Kharlamova, E. (2021). "Unification of the Chinese Radicals." SSRN.
- Halpern, J. (1990). SKIP. kanji.org.
- Unicode 17.0.0, Chapter 18: East Asian Scripts.