Windows-1252

From Wikipedia, the free encyclopedia

Windows-1252 is a character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows in English and some other Western languages. The encoding is a superset of ISO 8859-1, but differs from the IANA's ISO-8859-1 by using displayable characters rather than control characters in the 0x80 to 0x9F range. It is known to Windows by the code page number 1252, and by the IANA-approved name "windows-1252". This code page also contains all the printable characters that are in ISO 8859-15 (though some are mapped to different code points).

Many web browsers treat the MIME charset ISO-8859-1 as Windows-1252 (the extra control codes in ISO-8859-1 are forbidden in HTML anyway), and so codes from it are often seen in web pages that declare their encoding as ISO-8859-1. This is also true of e-mail programs. However, there can be difficulties from the use of such characters, particularly when the recipient is using a non-Windows system such as Linux or MacOS, which may have assigned no meaning or a different proprietary set of characters to this range.

A popular misconception is that the term "ANSI code page", which is used in the Microsoft Windows documentation, is synonymous with this code page. In fact, there exists no ANSI standard describing this code page; the closest existing ANSI standard is ANSI ISO 8859-1. Instead, the Windows documentation uses the term "ANSI code page" to refer to the system's current 8-bit GUI code page (as opposed to the OEM code page used for console apps and some other functions), because this once was fixed to ANSI ISO 8859-1 in a very early version of Windows. However, today the system's ANSI code page will be 1252 only in locale versions for Western European languages (English, Spanish, German, French, etc.).

The following table shows Windows-1252, with changes from ISO-8859-1 highlighted with a different background colour, as well as a decimal conversion in italics.

Windows-1252 (CP1252)
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x NUL
0
SOH
1
STX
2
ETX
3
EOT
4
ENQ
5
ACK
6
BEL
7
BS
8
TAB
9
LF
10
VT
11
FF
12
CR
13
SO
14
SI
15
1x DLE
16
DC1
17
DC2
18
DC3
19
DC4
20
NAK
21
SYN
22
ETB
23
CAN
24
EM
25
SUB
26
ESC
27
FS
28
GS
29
RS
30
US
31
2x SP
32
 !
33
"
34
#
35
$
36
%
37
&
38
'
39
(
40
)
41
*
42
+
43
,
44
-
45
.
46
/
47
3x 0
48
1
49
2
50
3
51
4
52
5
53
6
54
7
55
8
56
9
57
 :
58
 ;
59
<
60
=
61
>
62
 ?
63
4x @
64
A
65
B
66
C
67
D
68
E
69
F
70
G
71
H
72
I
73
J
74
K
75
L
76
M
77
N
78
O
79
5x P
80
Q
81
R
82
S
83
T
84
U
85
V
86
W
87
X
88
Y
89
Z
90
[
91
\
92
]
93
^
94
_
95
6x `
96
a
97
b
98
c
99
d
100
e
101
f
102
g
103
h
104
i
105
j
106
k
107
l
108
m
109
n
110
o
111
7x p
112
q
113
r
114
s
115
t
116
u
117
v
118
w
119
x
120
y
121
z
122
{
123
|
124
}
125
~
126
DEL
127
8x
128
 
129

130
ƒ
131

132

133

134

135
ˆ
136

137
Š
138

139
Œ
140
 
141
Ž
142
 
143
9x  
144

145

146

147

148

149

150

151
˜
152

153
š
154

155
œ
156
 
157
ž
158
Ÿ
159
Ax NBSP
160
¡
161
¢
162
£
163
¤
164
¥
165
¦
166
§
167
¨
168
©
169
ª
170
«
171
¬
172
SHY
173
®
174
¯
175
Bx °
176
±
177
²
178
³
179
´
180
µ
181

182
·
183
¸
184
¹
185
º
186
»
187
¼
188
½
189
¾
190
¿
191
Cx À
192
Á
193
Â
194
Ã
195
Ä
196
Å
197
Æ
198
Ç
199
È
200
É
201
Ê
202
Ë
203
Ì
204
Í
205
Î
206
Ï
207
Dx Ð
208
Ñ
209
Ò
210
Ó
211
Ô
212
Õ
213
Ö
214
×
215
Ø
216
Ù
217
Ú
218
Û
219
Ü
220
Ý
221
Þ
222
ß
223
Ex à
224
á
225
â
226
ã
227
ä
228
å
229
æ
230
ç
231
è
232
é
233
ê
234
ë
235
ì
236
í
237
î
238
ï
239
Fx ð
240
ñ
241
ò
242
ó
243
ô
244
õ
245
ö
246
÷
247
ø
248
ù
249
ú
250
û
251
ü
252
ý
253
þ
254
ÿ
255

According to the information on Microsoft's and the Unicode Consortium's websites, positions 81, 8D, 8F, 90, and 9D are unused. However the Windows API call for converting from code pages to Unicode maps these to the corresponding C1 control codes. The euro character at position 80 was not present in earlier versions of this code page, nor were the S and Z with caron (háček).

In Windows, the characters from Windows-1252 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad. (By omitting the zero one can also enter characters from the older code page 437 in this way.)

[edit] See also

[edit] External links

In other languages