CEDICT

From Wikipedia, the free encyclopedia

The CEDICT project was started by Paul Denisowski in 1997 with the aim to provide a complete Chinese to English dictionary with pronunciation in pinyin for the Chinese characters.

Contents

[edit] Content

CEDICT is merely a text file; other programs are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database[1]. CEDICT is not used for Unihan's definitions and pronunciations of individual characters.

The basic format of a CEDICT entry is:

Traditional Simplified [pin1 yin1] /English equivalent 1/equivalent 2/
中國 中国 [Zhong1 guo2] /China/Middle Kingdom/

CEDICT is now primarily encoded in UTF-8, but compatibility versions are available in GB2312 and Big5. The compatibility versions omit either the Traditional or the Simplified characters respectively.

Features:

  • Traditional and Simplified Chinese
  • Pinyin (several pronunciations)
  • English (several)

As of November 2007, it has 55.618 Chinese entries[2].

[edit] History

Year Event
1991 EDICT Japanese dictionary project was started by Jim Breen.
1997 CEDICT project started by Paul Denisowski, on the model of EDICT.
1999 CEDICT ownership transferred to Erik Peterson of http://www.mandarintools.com/cedict.html.
2007 MDBG started a new project called CC-CEDICT which continues the CEDICT project with a new license: Creative Commons Attribution-Share Alike 3.0 License, allowing more projects to use it. Additionally a work flow has been set up to streamline the process of submitting, reviewing and processing new entries.

[edit] Sub-projects

CEDICT has shown the way to some other projects, such HanDeDict (95.000 Chinese entries), the Chinese-German free dictionary. A CFDICT Chinese-French dictionary and a Hungrarian-Chinese dictionary project are under discussion. Some older CEDICT data is also found in the Adsotrans dictionary.

[edit] External links

[edit] See Also

Adsotrans

Languages