Windows-1252
From Wikipedia, the free encyclopedia
Windows-1252 is a character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows in English and some other Western languages. The encoding is a superset of ISO 8859-1, but differs from the IANA's ISO-8859-1 by using displayable characters rather than control characters in the 0x80 to 0x9F range. It is known to Windows by the code page number 1252, and by the IANA-approved name "windows-1252". This code page also contains all the printable characters that are in ISO 8859-15 (though some are mapped to different code points).
Many web browsers treat the MIME charset ISO-8859-1 as Windows-1252 (the extra control codes in ISO-8859-1 are forbidden in HTML anyway), and so codes from it are often seen in web pages that declare their encoding as ISO-8859-1. This is also true of e-mail programs. However, there can be difficulties from the use of such characters, particularly when the recipient is using a non-Windows system such as Linux or MacOS, which may have assigned no meaning or a different proprietary set of characters to this range.
A popular misconception is that the term "ANSI code page", which is used in the Microsoft Windows documentation, is synonymous with this code page. In fact, there exists no ANSI standard describing this code page; the closest existing ANSI standard is ANSI ISO 8859-1. Instead, the Windows documentation uses the term "ANSI code page" to refer to the system's current 8-bit GUI code page (as opposed to the OEM code page used for console apps and some other functions), because this once was fixed to ANSI ISO 8859-1 in a very early version of Windows. However, today the system's ANSI code page will be 1252 only in locale versions for Western European languages (English, Spanish, German, French, etc.).
The following table shows Windows-1252, with changes from ISO-8859-1 highlighted with a different background colour, as well as a decimal conversion in italics.
Windows-1252 (CP1252) | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
x0 | x1 | x2 | x3 | x4 | x5 | x6 | x7 | x8 | x9 | xA | xB | xC | xD | xE | xF | |
0x | NUL 0 |
SOH 1 |
STX 2 |
ETX 3 |
EOT 4 |
ENQ 5 |
ACK 6 |
BEL 7 |
BS 8 |
TAB 9 |
LF 10 |
VT 11 |
FF 12 |
CR 13 |
SO 14 |
SI 15 |
1x | DLE 16 |
DC1 17 |
DC2 18 |
DC3 19 |
DC4 20 |
NAK 21 |
SYN 22 |
ETB 23 |
CAN 24 |
EM 25 |
SUB 26 |
ESC 27 |
FS 28 |
GS 29 |
RS 30 |
US 31 |
2x | SP 32 |
! 33 |
" 34 |
# 35 |
$ 36 |
% 37 |
& 38 |
' 39 |
( 40 |
) 41 |
* 42 |
+ 43 |
, 44 |
- 45 |
. 46 |
/ 47 |
3x | 0 48 |
1 49 |
2 50 |
3 51 |
4 52 |
5 53 |
6 54 |
7 55 |
8 56 |
9 57 |
: 58 |
; 59 |
< 60 |
= 61 |
> 62 |
? 63 |
4x | @ 64 |
A 65 |
B 66 |
C 67 |
D 68 |
E 69 |
F 70 |
G 71 |
H 72 |
I 73 |
J 74 |
K 75 |
L 76 |
M 77 |
N 78 |
O 79 |
5x | P 80 |
Q 81 |
R 82 |
S 83 |
T 84 |
U 85 |
V 86 |
W 87 |
X 88 |
Y 89 |
Z 90 |
[ 91 |
\ 92 |
] 93 |
^ 94 |
_ 95 |
6x | ` 96 |
a 97 |
b 98 |
c 99 |
d 100 |
e 101 |
f 102 |
g 103 |
h 104 |
i 105 |
j 106 |
k 107 |
l 108 |
m 109 |
n 110 |
o 111 |
7x | p 112 |
q 113 |
r 114 |
s 115 |
t 116 |
u 117 |
v 118 |
w 119 |
x 120 |
y 121 |
z 122 |
{ 123 |
| 124 |
} 125 |
~ 126 |
DEL 127 |
8x | € 128 |
129 |
‚ 130 |
ƒ 131 |
„ 132 |
… 133 |
† 134 |
‡ 135 |
ˆ 136 |
‰ 137 |
Š 138 |
‹ 139 |
Œ 140 |
141 |
Ž 142 |
143 |
9x | 144 |
‘ 145 |
’ 146 |
“ 147 |
” 148 |
• 149 |
– 150 |
— 151 |
˜ 152 |
™ 153 |
š 154 |
› 155 |
œ 156 |
157 |
ž 158 |
Ÿ 159 |
Ax | NBSP 160 |
¡ 161 |
¢ 162 |
£ 163 |
¤ 164 |
¥ 165 |
¦ 166 |
§ 167 |
¨ 168 |
© 169 |
ª 170 |
« 171 |
¬ 172 |
SHY 173 |
® 174 |
¯ 175 |
Bx | ° 176 |
± 177 |
² 178 |
³ 179 |
´ 180 |
µ 181 |
¶ 182 |
· 183 |
¸ 184 |
¹ 185 |
º 186 |
» 187 |
¼ 188 |
½ 189 |
¾ 190 |
¿ 191 |
Cx | À 192 |
Á 193 |
 194 |
à 195 |
Ä 196 |
Å 197 |
Æ 198 |
Ç 199 |
È 200 |
É 201 |
Ê 202 |
Ë 203 |
Ì 204 |
Í 205 |
Î 206 |
Ï 207 |
Dx | Ð 208 |
Ñ 209 |
Ò 210 |
Ó 211 |
Ô 212 |
Õ 213 |
Ö 214 |
× 215 |
Ø 216 |
Ù 217 |
Ú 218 |
Û 219 |
Ü 220 |
Ý 221 |
Þ 222 |
ß 223 |
Ex | à 224 |
á 225 |
â 226 |
ã 227 |
ä 228 |
å 229 |
æ 230 |
ç 231 |
è 232 |
é 233 |
ê 234 |
ë 235 |
ì 236 |
í 237 |
î 238 |
ï 239 |
Fx | ð 240 |
ñ 241 |
ò 242 |
ó 243 |
ô 244 |
õ 245 |
ö 246 |
÷ 247 |
ø 248 |
ù 249 |
ú 250 |
û 251 |
ü 252 |
ý 253 |
þ 254 |
ÿ 255 |
According to the information on Microsoft's and the Unicode Consortium's websites, positions 81, 8D, 8F, 90, and 9D are unused. However the Windows API call for converting from code pages to Unicode maps these to the corresponding C1 control codes. The euro character at position 80 was not present in earlier versions of this code page, nor were the S and Z with caron (háček).
In Windows, the characters from Windows-1252 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad. (By omitting the zero one can also enter characters from the older code page 437 in this way.)