Chinese characters description languages

From Wikipedia, the free encyclopedia

The Chinese characters description languages are several proposed languages to describe the most accurately and completelly Chineses (or CJKV) characters and their attached informations such the list of components, the list of strokes (basic and complex), their order, and the localization of each of them on a background empty square. This way is currently lead by the CDL of the Wenlin Institute, the SMCL, and the Hanglyph languages. They answer to bipmap's discriptions limits, and to the need of indexation of the ten thousands rare variants. They all aims to work for Kaishu style and Song style.

Contents

[edit] The CDL from Wenlin

Chinese character Description Language is a font technology, based on XML, co-created by Tom Bishop and Richard Cook for the Wenlin Institute, designed for describing any CJK character, but suitable for describing any glyph.

This XML based "language" actually set the stroke order and which CJK stroke is use in each character they already described.

The background is look like a square of 128px hight and large, in this background :

  1. each kind of stroke is draw in SVG (more than 50 strokes);
  2. a basic component is composed by calling several strokes. In this component, each stroke is describe by its bottom-left and top-right corner. Transformations are possible (reduction, enlargement, etc.). There are more than 1.000 basic components.
  3. a character is composed by calling several components. In this character, each component is describe by its bottom-left and top-right corner. Transformations are possible (reduction, enlargement, etc.)

Accordingly, a set of 50 strokes allow to construct a set of 1.000 components which allow to construct tens of thousands characters' descriptions. A change in the shape of one of the 50 basic stroke will be instantly visible on all other character including it.

T. Bishop and R. Cook explain this by the words :

"The stroke count of one character is generally related to the stroke counts of other characters. Most characters are built from components, and as long as the stroke counts of those components are defined, there is rarely any difficulty in adding them together to obtain the combined stroke count. Therefore, if a standard defines the strokes of a few thousand characters, it implicitly defines the strokes of many thousands of additional characters."[1]

[edit] SCML

Another XML based Chinese character Description Language.

[edit] HanGlyph

Another Chinese character Description Language.

[edit] Links

CDL language from Wenlin Institute
SCML
HanGlyph

[edit] Notes

  1. ^ Specification for CDL [Bishop & Cook], see note n⁰12, p.8. (PDF, 2003-10-31)