ISO-8859-1
From MobileRead
ISO-8859-1 is also known as Latin-1. The first 128 characters in the code match the ASCII code. These codes are also in the Windows-1252 character set.
Contents |
[edit] Character set
In English Windows OS, the characters from ISO-8859-1 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad.
Special characters should all be translated into their appropriate ISO LATIN code equivalent – either the numeric code, or the entity reference code. For instance, any ampersands need to be converted to “&” throughout a book. These codes are also supported under UTF-8 using U+00A0 to U+00FF which are the hexadecimal values for these code. This table includes a few ASCII characters that can be problematic to enter directly due to their specialized use in HTML.
| Number Code | Word Code | Description | Character |
|---|---|---|---|
| " | " | quotation mark | " |
| & | & | ampersand | & |
| ' | ' | apostrophe* | ' |
| < | < | less-than sign | < |
| > | > | greater-than sign | > |
| ~ | ∼ | large Tilde | ~ |
| € to Ÿ are not defined in this character set, see Windows-1252 codes; | |||
|   | | non-breaking space* | |
| ¡ | ¡ | inverted exclamation | ¡ |
| ¢ | ¢ | cent sign | ¢ |
| £ | £ | pound sterling | £ |
| ¤ | ¤ | Currency | ¤ |
| ¥ | ¥ | yen sign | ¥ |
| ¦ | ¦ | broken vertical bar | ¦ |
| § | § | section sign | § |
| ¨ | ¨ | umlaut (dieresis) | ¨ |
| © | © | copyright | © |
| ª | ª | feminine ordinal | ª |
| « | « | left angle quote | « |
| ¬ | ¬ | not sign | ¬ |
| ­ | ­ | soft hyphen* | |
| ® | ® | registered trademark | ® |
| ¯ | ¯ | macron accent | ¯ |
| ° | ° | degree sign | ° |
| ± | ± | plus or minus | ± |
| ² | ² | Superscript two | ² |
| ³ | ³ | Superscript three | ³ |
| ´ | ´ | acute accent | ´ |
| µ | µ | micro sign | µ |
| ¶ | ¶ | paragraph sign | ¶ |
| · | · | middle dot | · |
| ¸ | ¸ | cedilla | ¸ |
| ¹ | ¹ | Superscript one | ¹ |
| º | º | masculine ordinal | º |
| » | » | right angle quote | » |
| ¼ | ¼ | one fourth | ¼ |
| ½ | ½ | one half | ½ |
| ¾ | ¾ | three fourths | ¾ |
| ¿ | ¿ | inverted question mark | ¿ |
| À | À | capital A, grave accent | À |
| Á | Á | capital A, acute accent | Á |
| Â | Â | capital A, circumflex accent | Â |
| Ã | Ã | capital A, tilde | Ã |
| Ä | Ä | capital A, dieresis or umlaut mark | Ä |
| Å | Å | capital A, ring | Å |
| Æ | Æ | capital AE diphthong (ligature) | Æ |
| Ç | Ç | capital C, cedilla | Ç |
| È | È | capital E, grave accent | È |
| É | É | capital E, acute accent | É |
| Ê | Ê | capital E, circumflex accent | Ê |
| Ë | Ë | capital E, dieresis or umlaut mark | Ë |
| Ì | Ì | capital I, grave accent | Ì |
| Í | Í | capital I, acute accent | Ì |
| Î | Î | capital I, circumflex accent | Î |
| Ï | Ï | capital I, dieresis or umlaut mark | Ï |
| Ð | Ð | capital ETH | Ð |
| Ñ | Ñ | capital N, tilde | Ñ |
| Ò | Ò | capital O, grave accent | Ò |
| Ó | Ó | capital O, acute accent | Ó |
| Ô | Ô | capital O, circumflex accent | Ô |
| Õ | Õ | &capital O, tilde | Õ |
| Ö | Ö | &capital O, dieresis or umlaut mark | Ö |
| × | × | multiply sign | × |
| Ø | Ø | capital O, slash | Ø |
| Ù | Ù | capital U, grave accent | Ù |
| Ú | Ú | capital U, acute accent | Ú |
| Û | Û | capital U, circumflex accent | Û |
| Ü | Ü | capital U, dieresis or umlaut mark | Ü |
| Ý | Ý | capital Y, acute accent | Ý |
| Þ | Þ | capital THORN | Þ |
| ß | ß | small sharp s, German (sz ligature) | ß |
| à | à | small a, grave accent | à |
| á | á | small a, acute accent | á |
| â | â | small a, circumflex accent | â |
| ã | ã | small a, tilde | ã |
| ä | ä | small a, dieresis or umlaut mark | ä |
| å | å | small a, ring | å |
| æ | æ | small ae diphthong (ligature) | æ |
| ç | ç | small c, cedilla | ç |
| è | è | small e, grave accent | è |
| é | é | small e, acute accent | é |
| ê | ê | small e, circumflex accent | ê |
| ë | ë | small e, dieresis or umlaut mark | ë |
| ì | ì | small i, grave accent | ì |
| í | í | small i, acute accent | í |
| î | î | small i, circumflex accent | î |
| ï | ï | small i, dieresis or umlaut mark | ï |
| ð | ð | small eth | ð |
| ñ | ñ | small n, tilde | ñ |
| ò | ò | small o, grave accent | ò |
| ó | ó | small o, acute accent | ó |
| ô | ô | small o, circumflex accent | ô |
| õ | õ | small o, tilde | õ |
| ö | ö | small o, dieresis or umlaut mark | ö |
| ÷ | ÷ | division sign | ÷ |
| ø | ø | small o, slash | ø |
| ù | ù | small u, grave accent | ù |
| ú | ú | small u, acute accent | ú |
| û | û | small u, circumflex accent | û |
| ü | ü | small u, dieresis or umlaut mark | ü |
| ý | ý | small y, acute accent | ý |
| þ | þ | small thorn | þ |
| ÿ | ÿ | small y, dieresis or umlaut mark | ÿ |
* While most of the symbols in the table are for communication there are a few that are specifically typographic.
- The non-breaking space is used as a forcing space. A line cannot terminate on a forcing space so it serves to force two words to behave as one.
- The soft hyphen marks the location of a hyphen point. It can be used to hyphenate a word if it appears near the end of a line. Otherwise the character should be ignored and not printed.
- the apostrophe is defined as an XML character with the mnemonic apos but this is not recognized by most browsers. Use the numeric 39 instead.
[edit] Unsupported codes
Not all devices or implementations of this standard support all of the characters defined in the standard. Here are some of the exceptions:
- Gemstar devices: no support for 164, 178, 179, 185, 188, 189, 190, 208, 222, 240, 254.
- ETI devices: no support for 164, 188, 189, 190, 208, 222, 240, 254
- MobiPocket and Amazon Kindle devices: 126 mapped to – instead of Tilde.
- PML supports all codes.
[edit] Coverage
- Modern languages with complete coverage of their alphabet
|
|
|
[edit] ISO-8859-15
The ISO-8859-15 standard (also known as "Latin alphabet no. 9" or simply Latin-9) is designed to address some of the shortcomings in the original ISO-8859-1 standard. The idea is to fully support a few more western languages and add the Euro symbol by replacing some little used symbols in 8859-1. These symbols were previously included in the Windows-1252 character set.
[edit] Changes from ISO-8859-1
| Position | 164 | 166 | 168 | 180 | 184 | 188 | 189 | 190 |
|---|---|---|---|---|---|---|---|---|
| 8859-1 | ¤ | ¦ | ¨ | ´ | ¸ | ¼ | ½ | ¾ |
| 8859-15 | € | Š | š | Ž | ž | Œ | œ | Ÿ |
€ became necessary when the Euro was introduced. The rest were excluded from ISO 8859-1 because it was motivated by information exchange and not typography. Š, š, Ž, and ž are used in some loanwords and transliteration of Russian names in Finnish and Estonian typography. Œ and œ are French ligatures, and Ÿ is needed in French all-caps text, as it is present in a few proper names such as the city of l'Haÿ-les-Roses.
[edit] Extra Languages covered
- Estonian
- Dutch (minus the IJ, ij ligatures)
- Finnish
- French
- Malay
- Tagalog

