From MobileRead
Jump to: navigation, search

Windows-1252 is a superset of ISO-8859-1 by adding characters in the range of 128 to 159.


[edit] Introduction

The original ASCII code was designed to work in 7 bits which offers 128 separate characters. The first 32 (0 - 31) were reserved for control codes but most of the rest are printable. The character 127 was defined to implement the backspace/delete functionality but on some devices (like the eb1150) it will be shown as a small box if coded. Since most items in a computer are stored in bytes with its 8 bits there are another 128 characters that could be used. The ISO-8859-1 standard uses the ones from 160 to 255 but that still leaves the range of 128 to 159. These are non-standard since it depends on which code page you get your text from but they are still useful and the text editor in the eBook Publisher and some other applications recognize them as in the following table. (Conforms to Windows-1252 code page.)

In English Windows, the characters from Windows-1252 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad.

Note that the display of curly characters will vary greatly depending on the font chosen. In this wiki they are unlikely to look ok. On dedicated eBook Readers these will normally display properly.

[edit] 1252 code page special characters

Note that the word codes shown are for reference. They will not normally generate these values but will likely generate the equivalent ISO or UTF-8 values depending on the reader (see special characters). The full Windows-1252 includes ASCII values and ISO-8859-1 values. The ones shown here are unique to this coding.

Number Code Word Code Character Description Unicode (decimal)
€ € Euro U+20AC (8364)
  undefined none
‚ ‚ single low 9 quote U+201A (8218)
ƒ &fnof ƒ function, florin U+0192 (402)
„ &dbquo; double low 9 quote U+201E (8222)
… … ellipse U+2026 (8230)
† † dagger U+2020 (8224)
‡ ‡ Double Dagger U+2021 (8225)
ˆ ˆ ˆ circumflex U+02C6 (710)
‰ ‰ per mille U+2030 (8240)
Š Š Š large S caron U+0160 (352)
‹ ‹ left angle quote U+2039 (8249)
ΠΠΠOE ligature U+0152 (338)
Ž Ž Ž Large Z caron* U+017D (381)
‘ ‘ left single curly quote U+2018 (8216)
’ ’ right single curly quote U+2019 (8217)
“ “ left double curly quote U+201C (8220)
” ” right double curly quote U+201D (8221)
• • bullet U+2022 (8226)
– – en dash U+2013 (8211)
— — em dash U+2014 (8212)
˜ ˜ ˜ tilde U+02DC (732)
™ ™ trademark U+2122 (8482)
š š š small s caron U+0161 (353)
› › right angle quote U+203A (8250)
œ œ œ oe ligature U+0153 (339)
ž ž ž small z caron* U+017E (382)
Ÿ Ÿ Ÿ Y umlat U+0178 (376)

Note * Zcaron and zcaron not in HTML entities

[edit] Coverage

All alphabets in the ISO-8859-1 set are covered plus
  • Estonian (adding Š, š, Ž, ž for loan words)
  • French (adding Œ, œ and the very rare Ÿ; they are generally replaced by 'OE' and 'oe' without the ligature, and 'Y' without the umlaut)
  • Finnish (adding Š, š, Ž, ž for loan words)
  • adding the Euro symbol
  • provides specialized eBook printing characters such as curly quotes and emdash.

[edit] Entering characters

In the English version of Windows, the characters from Windows-1252 and ISO-8859-1 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad.

Personal tools

MobileRead Networks