Chinese input methods for computers

From Wikipedia, the free encyclopedia

Chinese characters
Precursors
Traditional Chinese
Variant characters
Simplified Chinese
Simplified Chinese (2nd-round)
Traditional/Simplified (debate)
Kanji
Hanja
Hán tự
East Asian calligraphy
Input methods

In textual form, the Chinese language uses ideograms [1], as individual characters or in combinations, they form ideas and words to express meaning. Early Chinese computers used keyboards with thousands of keys.[citation needed] The number of keys on a conventional international computer keyboard is insufficient for all the Chinese characters.

Contents

[edit] Overview

A variety of keyboard input method editors (IME) have been designed to allow the input of Chinese characters using standard keyboards.

Keyboard input methods can be classified in three main types:

  • by encoding
  • by pronunciation
  • by structure of the characters.

The following are just some samples of Chinese input methods. Many of those input methods have variations. Full Pinyin and Double Pinyin are variations of the pinyin input method. In addition, the methods which require the user to select a character from a menu generally have sophisticated methods for guessing which characters the user intends based on context.

Each standard has its strengths and weaknesses and usage varies between people. For example, for someone who is already familiar with pinyin, the pinyin method can be learned instantly, and most Mandarin speakers master pronunciation through pinyin. However, the maximum typing rate is limited, and learning the system can be difficult for those not familiar with pinyin. Wubi takes much effort to learn, but expert typists can enter text much faster with it than with the phonetic methods. Because of these factors, there is very little likelihood of a "standard" method evolving.

An experimental Chinese keyboard with many keys.
An experimental Chinese keyboard with many keys.

Other means of entering Chinese characters include handwriting recognition, OCR and voice recognition. The first and last of these usually must be “trained” before use; that is, the user must first use the system in a special “learning mode” so that the system can get accustomed to the user’s handwriting or speech pattern, and failure to do so will increase the recognition error rate. These, especially the latter two, are less frequently used than keyboard-based input methods and, as with even the most advanced English language systems, suffer from relatively high error rates, especially when used without proper “training”, though the error rates are an acceptable trade-off to many users.

Particular mention should be given to handwriting recognition systems: Although mostly associated with stylus- and tablet-based systems, it is in fact one of the built-in input methods in Windows XP systems. Handwriting recognition systems for Chinese are generally far more reliable.

Finally, some systems allow Chinese characters to be entered by typing their equivalent English words.

[edit] Pronunciation

[edit] Character structure

[edit] Combination of pronunciation and character structure

  • Tze-loi method (子來; 子来)
  • Renzhi code method (認知碼; 认知码)

[edit] See also

[edit] External links

Information
Tutorials
Tools

[edit] Notes

  1. ^ "ideogram" in The New Oxford Dictionary of English, Oxford University Press 1998.