Combining grapheme joiner

From Wikipedia, the free encyclopedia

The combining grapheme joiner (CGJ, U+034F) is a Unicode character that has no visible glyph and is "default ignorable" by applications. Its purpose is to separate characters that should not be considered digraphs. For example, in a Hungarian language context, adjoining characters c and s would normally be considered equivalent to the cs digraph. If they are separated by the CGJ, they will be considered as two separate graphemes.

In the case of several consecutive combining diacritics, an intervening CGJ indicates that they should not be stacked but placed horizontally.

Compare to this the "zero-width non-joiner" (as it were a space mark of width zero) at U+200C in the General Punctuation range.