ISO/IEC 8859-4

From Wikipedia, the free encyclopedia

ISO 8859-4, also known as Latin-4 or "North European", is an 8-bit character encoding, part of the ISO 8859 standard. It was designed originally to cover Estonian, Latvian, Lithuanian, Greenlandic, and Sami. It has been largely superseded by ISO 8859-10 and Unicode.

ISO_8859-4:1988, more commonly known by its preferred MIME name of ISO-8859-4 is the IANA charset consisting of this standard used together with the control codes from ISO/IEC 6429 for the C0 (0x00-0x1F) and C1 (0x80-0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted. This character set also has the aliases iso-ir-110, ISO_8859-4, latin4, l4 and csISOLatin4.

[edit] Codepage layout

ISO/IEC 8859-4
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x unused
1x
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~  
8x unused
9x
Ax NBSP Ą ĸ Ŗ ¤ Ĩ Ļ § ¨ Š Ē Ģ Ŧ SHY Ž ¯
Bx ° ą ˛ ŗ ´ ĩ ļ ˇ ¸ š ē ģ ŧ Ŋ ž ŋ
Cx Ā Á Â Ã Ä Å Æ Į Č É Ę Ë Ė Í Î Ī
Dx Ð Ņ Ō Ķ Ô Õ Ö × Ø Ų Ú Û Ü Ũ Ū ß
Ex ā á â ã ä å æ į č é ę ë ė í î ī
Fx đ ņ ō ķ ô õ ö ÷ ø ų ú û ü ũ ū ˙

In the table above, 20 is the regular SPACE character, and A0 is the NO-BREAK SPACE. AD is a SOFT HYPHEN, which should not appear at all in compliant web browsers.

Code values 00-1F, 7F, and 80-9F are not assigned to characters by ISO/IEC 8859-4.

[edit] External links

  • ISO/IEC 8859-4:1998
  • ISO/IEC 8859-4:1998 - 8-bit single-byte coded graphic character sets, Part 4: Latin alphabet No. 4 (draft dated February 12, 1998, published July 1, 1998)
  • Standard ECMA-94: 8-Bit Single Byte Coded Graphic Character Sets - Latin Alphabets No. 1 to No. 4 2nd edition (June 1986)
  • ISO-IR 110 Right-Hand Part of Latin Alphabet No.4 (February 1, 1986)