ISO/IEC 8859-9
From Wikipedia, the free encyclopedia
ISO 8859-9, also known as Latin-5 or "Turkish", is an 8-bit character encoding, part of the ISO 8859 standard. It was designed originally to cover the Turkish language, designed as being of more use than the ISO 8859-3 encoding. It is identical to ISO 8859-1 except for these six replacements of characters mostly used to write the Icelandic language:
Position | 0xD0 | 0xDD | 0xDE | 0xF0 | 0xFD | 0xFE |
---|---|---|---|---|---|---|
8859-9 | Ğ | İ | Ş | ğ | ı | ş |
8859-1 | Ð | Ý | Þ | ð | ý | þ |
ISO_8859-9:1989, more commonly known by its preferred MIME name of ISO-8859-9, is the IANA charset consisting of this standard used together with the control codes from ISO/IEC 6429 for the C0 (0x00–0x1F) and C1 (0x80–0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted. This charset also has the aliases iso-ir-148, ISO_8859-9, latin5, l5 and csISOLatin5.
[edit] Codepage layout
The following character set table may require cleanup to meet Wikipedia's quality standards. Please improve this table if you can. |
ISO/IEC 8859-9 | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
x0 | x1 | x2 | x3 | x4 | x5 | x6 | x7 | x8 | x9 | xA | xB | xC | xD | xE | xF | |
0x | unused | |||||||||||||||
1x | ||||||||||||||||
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
8x | unused | |||||||||||||||
9x | ||||||||||||||||
Ax | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY | ® | ¯ |
Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
Dx | Ğ | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | İ | Ş | ß |
Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
Fx | ğ | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ı | ş | ÿ |
In the table above, 20 is the regular SPACE character, and A0 is the NO-BREAK SPACE. AD is a SOFT HYPHEN, which should not appear at all in compliant web browsers.
Code values 0x00–0x1F, 0x7F, and 0x80–0x9F are not assigned to characters by ISO/IEC 8859-9.
[edit] External links
- ISO/IEC 8859-9:1999
- Standard ECMA-128: 8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 2nd edition (December 1999)
- ISO-IR 148 Right-Hand Part of Latin Alphabet No.5 (October 1, 1988)