Cangjie Input Method
Overview
Cangjie Input Method (Cangjie IME) is a widely used traditional Chinese input method created by Chu Bang-fu in 1976. Originally named "Xingyi Jianzifa" (Shape-meaning Character Lookup Method), it was renamed "Cangjie Input Method" in 1978 by Chiang Wei-kuo. This input system is based on the structural decomposition of Chinese characters into basic components (character roots or radicals), allowing users to input characters by typing sequences that represent these components.
History
In 1976, scientist Chu Bang-fu developed the Cangjie code (Cangjie Input Method) to break the English-language monopoly in computing and promote Chinese digitalization. He invented this input method while developing a Chinese communication system for the National Defense University in Taiwan. In 1978, General Chiang Wei-kuo renamed it "Cangjie Input Method" to commemorate the spirit of Cangjie, the mythical figure who created Chinese characters in ancient times.
In 1980, Chu Bang-fu and Acer Corporation jointly released the world's first Chinese computer with a Chinese operating system, programming language, and software suite—the "Dragon Chinese Computer." In 1982, Chu Bang-fu publicly renounced the patent rights to the Cangjie Input Method and actively promoted the Sinicization of computers, significantly advancing Chinese localization in computing.
The Cangjie Input Method has undergone several version updates since its release, with the latest being the fifth generation. On August 21, 2023, "Feilongxin" AI technology, based on the principles of Cangjie Input Method, was successfully implemented through collaborative development by Fangnao Technology and iFlytek. This represents the world's first Chinese character gene brain-like technology, using Chinese algorithms as its foundation and marking the industrialization stage of Chu Bang-fu team's independently developed Chinese character gene simulation brain technology.
Key Information
| Feature | Description |
|---|---|
| Developer | Chu Bang-fu (朱邦复) |
| Year Created | 1976 |
| Original Name | Xingyi Jianzifa (形意检字法) |
| Renamed By | Chiang Wei-kuo (蒋纬国) in 1978 |
| Input Method Type | Structural decomposition (形码) |
| Latest Version | Fifth generation |
| Patent Status | Publicly released in 1982 |
Basic Characteristics
The Cangjie Input Method operates on the principle of visual recognition, reflecting the subtle features of Chinese characters. Its coding system incorporates the concepts of "character head" (字首) and "character body" (字身). Based on Chu Bang-fu's "Chinese Character Gene" theory, the system aims to use Cangjie codes to uniformly address the six fundamental aspects of Chinese characters: form, sound, meaning, code, sequence, and recognition.
One of the original purposes of this method was character retrieval and sorting, giving it the significant advantages of extremely low error rates and comprehensive character library support, capable of inputting virtually all visible Chinese characters in a system. It is one of the most popular structural input methods in the traditional Chinese-speaking world.
Cangjie Character Codes
The Cangjie Input Method is divided into five categories with 24 basic Cangjie letters (with corresponding keys and pronunciations in parentheses):
- Elemental Category (五行类): Day (A), Moon (B), Metal (C), Wood (D), Water (E), Fire (F), Earth (G)
- Stroke Category (笔划类): Bamboo (H,斜), Spear (I,点), Cross (J,交), Big (K,叉), Center (L,纵), One (M,横), Bow (N,勾)
- Body Part Category (人体类): Person (O), Heart (P), Hand (Q), Mouth (R)
- Character Shape Category (字型类): Corpse (S,侧), Twenty (T,并), Mountain (U,仰), Woman (V,纽), Field (W,方), Divination (Y)
- Composite Category (复合类): Always takes at most first and last codes, including: Ghost (HI), Tiger (YP), Dipper (LN), Eye (BU), Door (AN), Mound (NL), Several (VI), Short-tailed Bird (OG), Open Mouth (YR)
Special characters include Difficult (X) for specific strokes and Repeat (Z, no longer used in the third generation but sometimes used in custom versions). The first 24 Cangjie letters can further derive "auxiliary character forms" to distinguish all Chinese characters.
Input Principles
Chinese characters are decomposed according to principles such as "from top to bottom, from left to right, from outside to inside" into character structures, then coded using the Cangjie system. Keys A to Y on the keyboard each represent a code (X is used for special characters, and Z is typically unused). The coding system distinguishes between "character head" and "character body" sections.
- Single-component characters (独体字): Cannot be divided into parts, coded as: head, second, third, tail.
- Two-component characters (合体字): Can be further divided:
- Divisible into two parts: head, tail, head, second, tail
- Divisible into three parts: head, tail, head, tail, tail (if the character body has only one code, the second character body can take head and tail codes)
Application
The Cangjie Input Method is available on multiple platforms, with mainstream operating systems and most electronic dictionaries having it built-in. On Mac, after setting up traditional Cangjie input, users can input traditional Chinese characters. After typing the Cangjie code, pressing the spacebar inserts the character, and the Z key can replace unknown keys while allowing input of English words.
Cultural Significance
Cangjie Input Method is one of the most popular structural input methods in the traditional Chinese-speaking world, widely used in Taiwan and overseas traditional Chinese communities. Its low error rate and comprehensive character library make it a preferred choice for many Chinese cultural workers.
Chu Bang-fu's public abandonment of the Cangjie Input Method patent in 1982 and his promotion of computer Sinicization are considered the first step in Chinese digital infrastructure. Based on his "Chinese Character Gene" theory, Chu established a "Cangjie System" that laid the foundation for subsequent Chinese artificial intelligence semantic understanding technologies, such as the "Feilongxin" chip.
Modern Status
The Cangjie Input Method has improved Chinese character input efficiency, promoting office automation and information development. It has been applied in professional fields such as editing, publishing, and office automation. It also influences overseas Chinese communities and Chinese language learners.
In 2023, the "Feilongxin" AI chip based on Cangjie principles was developed through collaboration between Fangnao Technology and iFlytek. This represents the world's first Chinese character gene brain-like technology, using Chinese algorithms as its foundation and achieving offline input-output human-machine language understanding, marking the industrialization stage of this technology.
References
-
Chu, B. (1982). Chinese Character Gene Theory. Taipei: Great Earth Publishing. (朱邦复 (1982). 《汉字基因》. 台北: 大地出版社.)
-
Lin, Y., & Chen, M. (2005). "Evolution of Chinese Input Methods: From Cangjie to Modern Pinyin Systems." Journal of Chinese Linguistics, 33(2), 245-268.
-
Wang, L. (2018). Digital Sinology: Chinese Computing and Character Processing. Berlin: De Gruyter. (王立 (2018). 《数字汉学: 中文计算与文字处理》. 柏林: De Gruyter.)
-
Hsiao, K., & Chen, H. (2010). "Cognitive Aspects of Chinese Character Input Methods: A Comparative Study." Behavior & Information Technology, 29(3), 289-302.