# Codepage-1252 Codepage-1252 in [[MS-DOS]] or Windows-1252 in [[Windows]] is a superset of [[#ISO 8859-1]] "Latin-1". The major difference is that Latin-1 reserves 128-159 for control characters, while CP-1252 uses most of that space for additional printable characters. CP-1252 is an 8-bit extension of 7-bit [[ASCII]]. | Name | BInary | Oct | Dec | Hex | Glyph | HTML Name | | :-------------------------------------------- | -------- | --- | --- | --- | ----- | ---------- | | Euro sign | 10000000 | 200 | 128 | 80 | € | `&euro;` | | *Unused* | 10000001 | 201 | 129 | 81 | | | | Single low-9 quotation mark | 10000010 | 202 | 130 | 82 | ‚ | `&sbquo;` | | Latin small letter f with hook | 10000011 | 203 | 131 | 83 | ƒ | `&fnof;` | | Double low-9 quotation mark | 10000100 | 204 | 132 | 84 | „ | `&bdquo;` | | Horizontal ellipsis | 10000101 | 205 | 133 | 85 | … | `&hellip;` | | Dagger | 10000110 | 206 | 134 | 86 | † | `&dagger;` | | Double dagger | 10000111 | 207 | 135 | 87 | ‡ | `&Dagger;` | | Modifier letter circumflex accent | 10001000 | 210 | 136 | 88 | ˆ | `&circ;` | | Per mille sign | 10001001 | 211 | 137 | 89 | ‰ | `&permil;` | | Latin capital letter S with caron | 10001010 | 212 | 138 | 8A | Š | `&Scaron;` | | Single left-pointing angle quotation | 10001011 | 213 | 139 | 8B | ‹ | `&lsaquo;` | | Latin capital ligature OE | 10001100 | 214 | 140 | 8C | Œ | `&OElig;` | | *Unused* | 10001101 | 215 | 141 | 8D | | | | Latin capital letter Z with caron | 10001110 | 216 | 142 | 8E | Ž | `&Zcaron;` | | *Unused* | 10001111 | 217 | 143 | 8F | | | | *Unused* | 10010000 | 220 | 144 | 90 | | | | Left single quotation mark | 10010001 | 221 | 145 | 91 | ‘ | `&lsquo;` | | Right single quotation mark | 10010010 | 222 | 146 | 92 | ’ | `&rsquo;` | | Left double quotation mark | 10010011 | 223 | 147 | 93 | “ | `&ldquo;` | | Right double quotation mark | 10010100 | 224 | 148 | 94 | ” | `&rdquo;` | | Bullet | 10010101 | 225 | 149 | 95 | • | `&bull;` | | En dash | 10010110 | 226 | 150 | 96 | – | `&ndash;` | | Em dash | 10010111 | 227 | 151 | 97 | — | `&mdash;` | | Small tilde | 10011000 | 230 | 152 | 98 | ˜ | `&tilde;` | | Trade mark sign | 10011001 | 231 | 153 | 99 | ™ | `&trade;` | | Latin small letter S with caron | 10011010 | 232 | 154 | 9A | š | `&scaron;` | | Single right-pointing<br>angle quotation mark | 10011011 | 233 | 155 | 9B | › | `&rsaquo;` | | Latin small ligature oe | 10011100 | 234 | 156 | 9C | œ | `&oelig;` | | *Unused* | 10011101 | 235 | 157 | 9D | | | | Latin small letter z with caron | 10011110 | 236 | 158 | 9E | ž | `&zcaron;` | | Latin capital letter Y with diaeresis | 10011111 | 237 | 159 | 9F | Ÿ | `&Yuml;` | | Non-breaking space | 10100000 | 240 | 160 | A0 | NBSP | `&nbsp;` | | Inverted exclamation mark | 10100001 | 241 | 161 | A1 | ¡ | `&iexcl;` | | Cent sign | 10100010 | 242 | 162 | A2 | ¢ | `&cent;` | | Pound sign | 10100011 | 243 | 163 | A3 | £ | `&pound;` | | Currency sign | 10100100 | 244 | 164 | A4 | ¤ | `&curren;` | | Yen sign | 10100101 | 245 | 165 | A5 | ¥ | `&yen;` | | Pipe, broken vertical bar | 10100110 | 246 | 166 | A6 | ¦ | `&brvbar;` | | Section sign | 10100111 | 247 | 167 | A7 | § | `&sect;` | | Spacing diaeresis - umlaut | 10101000 | 250 | 168 | A8 | ¨ | `&uml;` | | Copyright sign | 10101001 | 251 | 169 | A9 | © | `&copy;` | | Feminine ordinal indicator | 10101010 | 252 | 170 | AA | ª | `&ordf;` | | Left double angle quotes | 10101011 | 253 | 171 | AB | « | `&laquo;` | | Negation | 10101100 | 254 | 172 | AC | ¬ | `&not;` | | Soft hyphen | 10101101 | 255 | 173 | AD | ­SHY | `&shy;` | | Registered trade mark sign | 10101110 | 256 | 174 | AE | ® | `&reg;` | | Spacing macron - overline | 10101111 | 257 | 175 | AF | ¯ | `&macr;` | | Degree sign | 10110000 | 260 | 176 | B0 | ° | `&deg;` | | Plus-or-minus sign | 10110001 | 261 | 177 | B1 | ± | `&plusmn;` | | Superscript two - squared | 10110010 | 262 | 178 | B2 | ² | `&sup2;` | | Superscript three - cubed | 10110011 | 263 | 179 | B3 | ³ | `&sup3;` | | Acute accent - spacing acute | 10110100 | 264 | 180 | B4 | ´ | `&acute;` | | Micro sign | 10110101 | 265 | 181 | B5 | µ | `&micro;` | | Pilcrow sign - paragraph sign | 10110110 | 266 | 182 | B6 | ¶ | `&para;` | | Middle dot - Georgian comma | 10110111 | 267 | 183 | B7 | · | `&middot;` | | Spacing cedilla | 10111000 | 270 | 184 | B8 | ¸ | `&cedil;` | | Superscript one | 10111001 | 271 | 185 | B9 | ¹ | `&sup1;` | | Masculine ordinal indicator | 10111010 | 272 | 186 | BA | º | `&ordm;` | | Right double angle quotes | 10111011 | 273 | 187 | BB | » | `&raquo;` | | Fraction one quarter | 10111100 | 274 | 188 | BC | ¼ | `&frac14;` | | Fraction one half | 10111101 | 275 | 189 | BD | ½ | `&frac12;` | | Fraction three quarters | 10111110 | 276 | 190 | BE | ¾ | `&frac34;` | | Inverted question mark | 10111111 | 277 | 191 | BF | ¿ | `&iquest;` | | Latin capital letter A with grave | 11000000 | 300 | 192 | C0 | À | `&Agrave;` | | Latin capital letter A with acute | 11000001 | 301 | 193 | C1 | Á | `&Aacute;` | | Latin capital letter A with circumflex | 11000010 | 302 | 194 | C2 |  | `&Acirc;` | | Latin capital letter A with tilde | 11000011 | 303 | 195 | C3 | à | `&Atilde;` | | Latin capital letter A with diaeresis | 11000100 | 304 | 196 | C4 | Ä | `&Auml;` | | Latin capital letter A with ring above | 11000101 | 305 | 197 | C5 | Å | `&Aring;` | | Latin capital letter AE | 11000110 | 306 | 198 | C6 | Æ | `&AElig;` | | Latin capital letter C with cedilla | 11000111 | 307 | 199 | C7 | Ç | `&Ccedil;` | | Latin capital letter E with grave | 11001000 | 310 | 200 | C8 | È | `&Egrave;` | | Latin capital letter E with acute | 11001001 | 311 | 201 | C9 | É | `&Eacute;` | | Latin capital letter E with circumflex | 11001010 | 312 | 202 | CA | Ê | `&Ecirc;` | | Latin capital letter E with diaeresis | 11001011 | 313 | 203 | CB | Ë | `&Euml;` | | Latin capital letter I with grave | 11001100 | 314 | 204 | CC | Ì | `&Igrave;` | | Latin capital letter I with acute | 11001101 | 315 | 205 | CD | Í | `&Iacute;` | | Latin capital letter I with circumflex | 11001110 | 316 | 206 | CE | Î | `&Icirc;` | | Latin capital letter I with diaeresis | 11001111 | 317 | 207 | CF | Ï | `&Iuml;` | | Latin capital letter ETH | 11010000 | 320 | 208 | D0 | Ð | `&ETH;` | | Latin capital letter N with tilde | 11010001 | 321 | 209 | D1 | Ñ | `&Ntilde;` | | Latin capital letter O with grave | 11010010 | 322 | 210 | D2 | Ò | `&Ograve;` | | Latin capital letter O with acute | 11010011 | 323 | 211 | D3 | Ó | `&Oacute;` | | Latin capital letter O with circumflex | 11010100 | 324 | 212 | D4 | Ô | `&Ocirc;` | | Latin capital letter O with tilde | 11010101 | 325 | 213 | D5 | Õ | `&Otilde;` | | Latin capital letter O with diaeresis | 11010110 | 326 | 214 | D6 | Ö | `&Ouml;` | | Multiplication sign | 11010111 | 327 | 215 | D7 | × | `&times;` | | Latin capital letter O with slash | 11011000 | 330 | 216 | D8 | Ø | `&Oslash;` | | Latin capital letter U with grave | 11011001 | 331 | 217 | D9 | Ù | `&Ugrave;` | | Latin capital letter U with acute | 11011010 | 332 | 218 | DA | Ú | `&Uacute;` | | Latin capital letter U with circumflex | 11011011 | 333 | 219 | DB | Û | `&Ucirc;` | | Latin capital letter U with diaeresis | 11011100 | 334 | 220 | DC | Ü | `&Uuml;` | | Latin capital letter Y with acute | 11011101 | 335 | 221 | DD | Ý | `&Yacute;` | | Latin capital letter THORN | 11011110 | 336 | 222 | DE | Þ | `&THORN;` | | Latin small letter sharp s - ess-zed | 11011111 | 337 | 223 | DF | ß | `&szlig;` | | Latin small letter a with grave | 11100000 | 340 | 224 | E0 | à | `&agrave;` | | Latin small letter a with acute | 11100001 | 341 | 225 | E1 | á | `&aacute;` | | Latin small letter a with circumflex | 11100010 | 342 | 226 | E2 | â | `&acirc;` | | Latin small letter a with tilde | 11100011 | 343 | 227 | E3 | ã | `&atilde;` | | Latin small letter a with diaeresis | 11100100 | 344 | 228 | E4 | ä | `&auml;` | | Latin small letter a with ring above | 11100101 | 345 | 229 | E5 | å | `&aring;` | | Latin small letter ae | 11100110 | 346 | 230 | E6 | æ | `&aelig;` | | Latin small letter c with cedilla | 11100111 | 347 | 231 | E7 | ç | `&ccedil;` | | Latin small letter e with grave | 11101000 | 350 | 232 | E8 | è | `&egrave;` | | Latin small letter e with acute | 11101001 | 351 | 233 | E9 | é | `&eacute;` | | Latin small letter e with circumflex | 11101010 | 352 | 234 | EA | ê | `&ecirc;` | | Latin small letter e with diaeresis | 11101011 | 353 | 235 | EB | ë | `&euml;` | | Latin small letter i with grave | 11101100 | 354 | 236 | EC | ì | `&igrave;` | | Latin small letter i with acute | 11101101 | 355 | 237 | ED | í | `&iacute;` | | Latin small letter i with circumflex | 11101110 | 356 | 238 | EE | î | `&icirc;` | | Latin small letter i with diaeresis | 11101111 | 357 | 239 | EF | ï | `&iuml;` | | Latin small letter eth | 11110000 | 360 | 240 | F0 | ð | `&eth;` | | Latin small letter n with tilde | 11110001 | 361 | 241 | F1 | ñ | `&ntilde;` | | Latin small letter o with grave | 11110010 | 362 | 242 | F2 | ò | `&ograve;` | | Latin small letter o with acute | 11110011 | 363 | 243 | F3 | ó | `&oacute;` | | Latin small letter o with circumflex | 11110100 | 364 | 244 | F4 | ô | `&ocirc;` | | Latin small letter o with tilde | 11110101 | 365 | 245 | F5 | õ | `&otilde;` | | Latin small letter o with diaeresis | 11110110 | 366 | 246 | F6 | ö | `&ouml;` | | Division sign | 11110111 | 367 | 247 | F7 | ÷ | `&divide;` | | Latin small letter o with slash | 11111000 | 370 | 248 | F8 | ø | `&oslash;` | | Latin small letter u with grave | 11111001 | 371 | 249 | F9 | ù | `&ugrave;` | | Latin small letter u with acute | 11111010 | 372 | 250 | FA | ú | `&uacute;` | | Latin small letter u with circumflex | 11111011 | 373 | 251 | FB | û | `&ucirc;` | | Latin small letter u with diaeresis | 11111100 | 374 | 252 | FC | ü | `&uuml;` | | Latin small letter y with acute | 11111101 | 375 | 253 | FD | ý | `&yacute;` | | Latin small letter thorn | 11111110 | 376 | 254 | FE | þ | `&thorn;` | | Latin small letter y with diaeresis | 11111111 | 377 | 255 | FF | ÿ | `&yuml;` | ## HTML Entities In addition to the HTML names, all above printable characters may be inserted into HTML documents using the HTML entity decimal number in the format `&#123;` or hex number `&xABCD;`. ### Euro Sign When ISO/IEC 8859-1 and CP-1252 were standardized the Euro did not exist. Microsoft added the Euro sign to CP-1252 with [[Windows 98]]. ISO/IEC 8859-15 added the Euro sign as well, but at `0xA4` instead of `0x80`. While the Euro sign also works using the CP-1252 numbers, this is not specified in any standard that I can find and is more properly referenced by its [[Unicode Standard]] character code `&#8364;` or `&x20AC;`. # ISO 8859-15 An extension to [[#ISO 8859-1]] which is *incompatible* with Codepage-1252. ## Incompatibilities with Codepage-1252 # ISO 8859-1 ISO/IEC 8859-1 is a standard which is identical to Codepage-1252 except that it reserved bytes 128-159 (`0x80`-`0x9F`) for other standards, particularly [[ANSI|ISO/IEC 6429]]. ISO/IEC 8859-1 was adapted from [[#ECMA-94]], using the exact same arrangement of glyphs. # ECMA-94 ECMA-94 AKA "Latin-1" was standardized in [[198?]] and a second edition was published in [[1986]]. It is largely a superset of [[#DEC MCS]] but it introduced a handful of [[#Incompatibilities with DEC MCS]] that made it not backwards compatible. This was, generally speaking, a bad move. ## Additions to DEC MCS ECMA-94 added 16 additional characters to the set beyond what [[#DEC MCS]] supported. | Name | BInary | Oct | Dec | Hex | Glyph | | | :-------------------------------- | -------- | --- | --- | --- | ----- | :--------- | | Non-breaking space | 10100000 | 240 | 160 | A0 | NBSP | | | Pipe, broken vertical bar | 10100110 | 246 | 166 | A6 | ¦ | `&brvbar;` | | Spacing diaeresis - umlaut | 10101000 | 250 | 168 | A8 | ¨ | `&uml;` | | Negation | 10101100 | 254 | 172 | AC | ¬ | `&not;` | | Soft hyphen | 10101101 | 255 | 173 | AD | ­SHY | `&shy;` | | Registered trade mark sign | 10101110 | 256 | 174 | AE | ® | `&reg;` | | Spacing macron - overline | 10101111 | 257 | 175 | AF | ¯ | `&macr;` | | Acute accent - spacing acute | 10110100 | 264 | 180 | B4 | ´ | `&acute;` | | Spacing cedilla | 10111000 | 270 | 184 | B8 | ¸ | `&cedil;` | | Fraction three quarters | 10111110 | 276 | 190 | BE | ¾ | `&frac34;` | | Latin capital letter ETH | 11010000 | 320 | 208 | D0 | Ð | `&ETH;` | | Multiplication sign | 11010111 | 327 | 215 | D7 | × | `&times;` | | Latin capital letter Y with acute | 11011101 | 335 | 221 | DD | Ý | `&Yacute;` | | Latin capital letter THORN | 11011110 | 336 | 222 | DE | Þ | `&THORN;` | | Latin small letter eth | 11110000 | 360 | 240 | F0 | ð | `&eth;` | | Division sign | 11110111 | 367 | 247 | F7 | ÷ | `&divide;` | | Latin small letter y with acute | 11111101 | 375 | 253 | FD | ý | `&yacute;` | | Latin small letter thorn | 11111110 | 376 | 254 | FE | þ | `&thorn;` | ## Incompatibilities with DEC MCS ECMA-94 moved a couple of [[#DEC MCS]]'s symbols around and made the terrible decision to drop three characters entirely. This final mistake meant that they had to re-add it later in [[Codepage-1252|ISO 8859-15]] in an awkward position, making it incompatible with Codepage-1252. | Name | DEC MCS | ECMA-94 | Glyph | | --------------------- | ------- | --------- | ----- | | Currency Symbol | `0xA8` | `0xA4` | ¤ | | Uppercase OE Ligature | `0xD7` | *REMOVED* | Œ | | Lowercase OE Ligature | `0xF7` | *REMOVED* | œ | | Uppercase Y Diaeresis | `0xDD` | *REMOVED* | Ÿ | | Lowercase Y Diaeresis | `0xFD` | `0xFF` | ÿ | Technically, ECMA-92 didn't add any other incompatibilities than those in the above table. However, [[ANSI|ECMA-48]] and [[ECMA-6]] were already incompatible with it, as they diverged slightly from [[#DEC MCS]] control codes in the `0x80`-`0x9F` range. # DEC MCS DEC developed a Multinational Character Set for their [[VT220]] terminal hardware. # References - https://en.wikipedia.org/wiki/Windows-1252 - https://en.wikipedia.org/wiki/ISO/IEC_8859 - https://en.wikipedia.org/wiki/ISO/IEC_8859-1 - https://en.wikipedia.org/wiki/ISO/IEC_8859-15