From 540 to 214: How the Kangxi Radicals Became the Backbone of CJK Lexicography

hbaristr Bacaan 3 minit

An Indexing Algorithm That Has Survived 310 Years

In 100 CE, Xu Shen faced an indexing problem: how to organize 9,353 logographic characters for retrieval with no alphabetical order. His Shuowen Jiezi decomposed characters into 540 recurring elements (bu 部). It worked, but the index was overfit -- many radicals classified only a handful of entries.

The Reduction Path

Year Dictionary Radicals Characters Innovation
100 CE Shuowen Jiezi (Xu Shen) 540 9,353 First radical-based index
1615 Zihui (Mei Yingzuo) 214 33,179 Radical-and-stroke sorting
1716 Kangxi Zidian (Zhang Yushu et al.) 214 47,035 Imperial standardization
1999 Unicode 3.0 214 27,484+ Digital encoding (U+2F00--U+2FD5)
2009 PRC Standard GF 0011 201 -- Simplified-character variant

Mei Yingzuo's Zihui (1615) applied two principles: eliminate radicals with trivially small character sets, and merge positional variants of the same root. Then sort by residual stroke count. The Kangxi Dictionary (1716, 47,035 characters, twelve volumes) adopted Mei's system -- the imperial imprimatur made it canonical.

Frontispiece of an 1827 reprint of the Kangxi Dictionary (康熙字典)
Frontispiece of an 1827 reprint of the *Kangxi Zidian (康熙字典). The 1716 original ran to twelve volumes and indexed 47,035 characters under Mei Yingzuo's 214-radical scheme. Source: Wikimedia Commons.*

A Wildly Unbalanced Tree

The top ten radicals alone account for ~23% of all 47,035 entries:

Rank # Radical Meaning Kangxi Entries
1 140 艸 (艹) grass 1,902
2 85 水 (氵) water 1,595
3 75 tree 1,369
4 64 手 (扌) hand 1,203
5 30 mouth 1,146
6 61 心 (忄) heart 1,115
7 142 insect 1,067
8 118 bamboo 953
9 149 speech 861
10 120 silk 823

Bottom: radical 138 (艮, stopping) holds 5 entries. A 380:1 ratio. Yet worst-case lookup is still O(log n) once the radical is identified, and identification is constant-time for a trained reader.

Chart showing all 214 Kangxi radicals in old-style fonts, ordered by stroke count
The full set of 214 Kangxi radicals rendered in old-style fonts that imitate the original Kangxi Zidian shapes, ordered by stroke count (1 stroke at left, 17 at right). Source: Wikimedia Commons.

Competing Indexing Systems

System Year Keys Prerequisite Best For
Kangxi radicals 1716 214 + strokes Memorize 214 + variants Semantic browsing, print dictionaries
Four-corner (Wang Yunwu) 1926 4--5 digit code 10 stroke-shape rules Telegraphers, numeric indexing
SKIP (Jack Halpern) 1990 4 patterns + strokes None (geometric division) Foreign learners
Spahn-Hadamitzky 1996 79 radicals Memorize 79 Learner dictionaries
Cangjie input 1976 24 forms, 5 keys Decomposition rules Fast digital input

SKIP classifies kanji into four geometric patterns -- left-right, up-down, enclosure, solid -- then counts strokes. No radical memorization required. Trade-off: no semantic content. The Kangxi system endures because it functions as both index and ontology.

The Unicode Compromise

The 214 radicals occupy U+2F00--U+2FD5, separate from CJK Unified Ideographs where the same glyphs appear as characters. U+2F00 (⼀) = "Kangxi Radical One" (metadata); U+4E00 (一) = the character "one" (content). Same glyph, different semantics. A supplementary block (U+2E80--U+2EFF) encodes positional variants. The architecture treats radicals as a 300-year-old indexing layer preserved alongside the data it indexes.

Browse our complete radical index to explore all 214.

References

  • Xu Shen, Shuowen Jiezi (100 CE)
  • Mei Yingzuo, Zihui (1615)
  • Kharlamova, E. (2021). "Unification of the Chinese Radicals." SSRN.
  • Halpern, J. (1990). SKIP. kanji.org.
  • Unicode 17.0.0, Chapter 18: East Asian Scripts.

Send feedback

Optional — only if you'd like a reply.