ISO/IEC 8859-9

From Wikipedia, the free encyclopedia

ISO 8859-9, also known as Latin-5 or "Turkish", is an 8-bit character encoding, part of the ISO 8859 standard. It was designed originally to cover the Turkic languages, designed as being of more use than the ISO 8859-3 encoding. It is identical to ISO 8859-1 except for these six replacements of characters mostly used to write the Icelandic language:

Position 0xD0 0xDD 0xDE 0xF0 0xFD 0xFE
8859-9 Ğ İ Ş ğ ı ş
8859-1 Ð Ý Þ ð ý þ

ISO_8859-9:1989, more commonly known by its preferred MIME name of ISO-8859-9, is the IANA charset consisting of this standard used together with the control codes from ISO/IEC 6429 for the C0 (0x00–0x1F) and C1 (0x80–0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted. This charset also has the aliases iso-ir-148, ISO_8859-9, latin5, l5 and csISOLatin5.

ISO/IEC 8859-9
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x unused
1x
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~  
8x unused
9x
Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Dx Ğ Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü İ Ş ß
Ex à á â ã ä å æ ç è é ê ë ì í î ï
Fx ğ ñ ò ó ô õ ö ÷ ø ù ú û ü ı ş ÿ

In the table above, 20 is the regular SPACE character, and A0 is the NO-BREAK SPACE. AD is a SOFT HYPHEN, which should not appear at all in compliant web browsers.

Code values 0x00–0x1F, 0x7F, and 0x80–0x9F are not assigned to characters by ISO/IEC 8859-9.

[edit] External links