Adsotrans

From Wikipedia, the free encyclopedia

Adso is an open source Chinese to English dictionary and natural language processing engine for Chinese text. The Adso project started in 2001. Its gist translation and dictionary interface are online at the Adsotrans website[1] Adsotrans. Its software and database are freely available for download at the site as well[2].

[edit] Content

With over 185,000 entries, Adso is the largest open source Chinese-English dictionary compilation on the Internet. It differs from other projects in providing part of speech and ontological data on word entries, and in reviewing user contributions. Project data is generated collaboratively by users and drawn from related projects including CEDICT and the Linguistic Data Consortium.

The Adso software engine provides text segmentation, hanzi-to-pinyin, gist translation, annotation, gist extraction and semantic analysis services. It is heavily used as a translation aid for Chinese-English translation. Adso also supports a specially-defined XML language which customizes software output. This has made it useful as preprocessor for statistical machine translation software such as GIZA++ or for reverse-index search engines such as Lucene.


[edit] References

[edit] External Links