ISO/IEC 8859-15

From Wikipedia, the free encyclopedia

ISO 8859-15 is part 15 of ISO 8859, a standard character encoding defined by International Organization for Standardization. It is also known as Latin-9, and unofficially as Latin-0 but not as Latin-15. It is similar to ISO 8859-1 but replaces some less common symbols with the euro sign and some other characters that were missing. It encodes characters as 8 bits and can be used to represent the alphabet and other important characters for storing English, French, German, Spanish and Portuguese (among other western European languages) texts on computers.

ISO-8859-15 (note the extra hyphen) is the IANA charset name for this standard used together with the control codes from ISO/IEC 6429 for the C0 (0x00–0x1F) and C1 (0x80–0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted. This charset has aliases ISO_8859-15 and Latin-9.

All the printable characters from both ISO 8859-1 and ISO 8859-15 are also found in Windows-1252.

Contents

[edit] Changes from ISO-8859-1

Position 0xA4 0xA6 0xA8 0xB4 0xB8 0xBC 0xBD 0xBE
8859-1 ¤ ¦ ¨ ´ ¸ ¼ ½ ¾
8859-15 Š š Ž ž Œ œ Ÿ

€ became necessary when the euro was introduced. The rest were excluded from ISO 8859-1, because it was motivated by information exchange and not typography. Š, š, Ž, and ž are used in some loanwords and transliteration of Russian names in Finnish and Estonian typography. Œ and œ are French ligatures, and Ÿ is needed in French all-caps text.


[edit] Coverage

ISO 8859-15 encodes what it refers to as "Latin alphabet no. 1". This character encoding is used throughout The Americas, Western Europe, Oceania, and much of Africa. It is also commonly used in most standard romanizations of East-Asian languages.

Each character is encoded as a single eight-bit code value. These code values can be used in almost any data interchange system to communicate in the following European languages:

Modern languages with complete coverage of their alphabet


Coverage of punctuation signs and apostrophes

For some languages listed above the correct typographical quotation marks are missing, for only « », " ", and ' ' are included.

Also, this encoding does not provide the correct character for the apostrophe, and oriented single high quotation marks, although some texts use the spacing grave accent and spacing acute accent which are both part of ISO 8859-1, instead of the 6-shaped/9-shaped quotations marks or apostrophes (and this works reliably with some font styles where all these characters are displayed as slanted wedge glyphs).

See also: Alphabets derived from the Latin

[edit] Complete table

Complete ISO-8859-15 map with differences from ISO-8859-1 highlighted.

ISO-8859-15
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x SP  ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9  :  ; < = >  ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x PAD HOP BPH NBH IND NEL SSA ESA HTS HTJ VTS PLD PLU RI SS2 SS3
9x DCS PU1 PU2 STS CCH MW SPA EPA SOS SGCI SCI CSI ST OSC PM APC
Ax NBSP ¡ ¢ £ ¥ Š § š © ª «  ¬ SHY ® ¯
Bx ° ± ² ³ Ž µ · ž ¹ º  » Œ œ Ÿ ¿
Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
Ex à á â ã ä å æ ç è é ê ë ì í î ï
Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
KEY
NUL C0 control code
PAD C1 control code
 ! Character that is the same in ISO 8859-1 and ISO 8859-15
Character in ISO 8859-15 that is different from the one at the same position in ISO-8859-1

[edit] See also

[edit] External links

  • ISO/IEC 8859-15:1999
  • ISO/IEC 8859-15:1998 - 8-bit single-byte coded graphic character sets, Part 15: Latin alphabet No. 9 (draft dated August 1, 1997; superseded by ISO/IEC 8859-15:1999, published March 15, 1999)
  • ISO-IR 203 European supplementary Latin set (September 16, 1998)