ISO-8859-1
From MobileRead
ISO-8859-1 is also known as Latin-1. The first 128 characters in the code match the ASCII code. These codes are also in the Windows-1252 character set.
Contents |
Character set
In English Windows OS, the characters from ISO-8859-1 can be inserted by holding down the Alt key and entering a zero followed by the character's three-digit decimal code on the numpad.
Special characters should all be translated into their appropriate ISO LATIN code equivalent – either the numeric code, or the entity reference code. For instance, any ampersands need to be converted to “&” throughout a book. These codes are also supported under UTF-8 using U+00A0 to U+00FF which are the hexadecimal values for these code. This table includes a few ASCII characters that can be problematic to enter directly due to their specialized use in HTML.
| Number Code | Word Code | Description | Character |
|---|---|---|---|
| " | " | quotation mark | " |
| & | & | ampersand | & |
| ' | ' | apostrophe* | ' |
| < | < | less-than sign | < |
| > | > | greater-than sign | > |
| ~ | ∼ | large Tilde | ~ |
| € to Ÿ are not defined in this character set, see Windows-1252 codes; | |||
|   | | non-breaking space* | |
| ¡ | ¡ | inverted exclamation | ¡ |
| ¢ | ¢ | cent sign | ¢ |
| £ | £ | pound sterling | £ |
| ¤ | ¤ | Currency | ¤ |
| ¥ | ¥ | yen sign | ¥ |
| ¦ | ¦ | broken vertical bar | ¦ |
| § | § | section sign | § |
| ¨ | ¨ | umlaut (dieresis) | ¨ |
| © | © | copyright | © |
| ª | ª | feminine ordinal | ª |
| « | « | left angle quote | « |
| ¬ | ¬ | not sign | ¬ |
| ­ | ­ | soft hyphen* | |
| ® | ® | registered trademark | ® |
| ¯ | ¯ | macron accent | ¯ |
| ° | ° | degree sign | ° |
| ± | ± | plus or minus | ± |
| ² | ² | Superscript two | ² |
| ³ | ³ | Superscript three | ³ |
| ´ | ´ | acute accent | ´ |
| µ | µ | micro sign | µ |
| ¶ | ¶ | paragraph sign | ¶ |
| · | · | middle dot | · |
| ¸ | ¸ | cedilla | ¸ |
| ¹ | ¹ | Superscript one | ¹ |
| º | º | masculine ordinal | º |
| » | » | right angle quote | » |
| ¼ | ¼ | one fourth | ¼ |
| ½ | ½ | one half | ½ |
| ¾ | ¾ | three fourths | ¾ |
| ¿ | ¿ | inverted question mark | ¿ |
| À | À | capital A, grave accent | À |
| Á | Á | capital A, acute accent | Á |
| Â | Â | capital A, circumflex accent | Â |
| Ã | Ã | capital A, tilde | Ã |
| Ä | Ä | capital A, dieresis or umlaut mark | Ä |
| Å | Å | capital A, ring | Å |
| Æ | Æ | capital AE diphthong (ligature) | Æ |
| Ç | Ç | capital C, cedilla | Ç |
| È | È | capital E, grave accent | È |
| É | É | capital E, acute accent | É |
| Ê | Ê | capital E, circumflex accent | Ê |
| Ë | Ë | capital E, dieresis or umlaut mark | Ë |
| Ì | Ì | capital I, grave accent | Ì |
| Í | Í | capital I, acute accent | Ì |
| Î | Î | capital I, circumflex accent | Î |
| Ï | Ï | capital I, dieresis or umlaut mark | Ï |
| Ð | Ð | capital ETH | Ð |
| Ñ | Ñ | capital N, tilde | Ñ |
| Ò | Ò | capital O, grave accent | Ò |
| Ó | Ó | capital O, acute accent | Ó |
| Ô | Ô | capital O, circumflex accent | Ô |
| Õ | Õ | &capital O, tilde | Õ |
| Ö | Ö | &capital O, dieresis or umlaut mark | Ö |
| × | × | multiply sign | × |
| Ø | Ø | capital O, slash | Ø |
| Ù | Ù | capital U, grave accent | Ù |
| Ú | Ú | capital U, acute accent | Ú |
| Û | Û | capital U, circumflex accent | Û |
| Ü | Ü | capital U, dieresis or umlaut mark | Ü |
| Ý | Ý | capital Y, acute accent | Ý |
| Þ | Þ | capital THORN | Þ |
| ß | ß | small sharp s, German (sz ligature) | ß |
| à | à | small a, grave accent | à |
| á | á | small a, acute accent | á |
| â | â | small a, circumflex accent | â |
| ã | ã | small a, tilde | ã |
| ä | ä | small a, dieresis or umlaut mark | ä |
| å | å | small a, ring | å |
| æ | æ | small ae diphthong (ligature) | æ |
| ç | ç | small c, cedilla | ç |
| è | è | small e, grave accent | è |
| é | é | small e, acute accent | é |
| ê | ê | small e, circumflex accent | ê |
| ë | ë | small e, dieresis or umlaut mark | ë |
| ì | ì | small i, grave accent | ì |
| í | í | small i, acute accent | í |
| î | î | small i, circumflex accent | î |
| ï | ï | small i, dieresis or umlaut mark | ï |
| ð | ð | small eth | ð |
| ñ | ñ | small n, tilde | ñ |
| ò | ò | small o, grave accent | ò |
| ó | ó | small o, acute accent | ó |
| ô | ô | small o, circumflex accent | ô |
| õ | õ | small o, tilde | õ |
| ö | ö | small o, dieresis or umlaut mark | ö |
| ÷ | ÷ | division sign | ÷ |
| ø | ø | small o, slash | ø |
| ù | ù | small u, grave accent | ù |
| ú | ú | small u, acute accent | ú |
| û | û | small u, circumflex accent | û |
| ü | ü | small u, dieresis or umlaut mark | ü |
| ý | ý | small y, acute accent | ý |
| þ | þ | small thorn | þ |
| ÿ | ÿ | small y, dieresis or umlaut mark | ÿ |
* While most of the symbols in the table are for communication there are a few that are specifically typographic.
- The non-breaking space is used as a forcing space. A line cannot terminate on a forcing space so it serves to force two words to behave as one.
- The soft hyphen marks the location of a hyphen point. It can be used to hyphenate a word if it appears near the end of a line. Otherwise the character should be ignored and not printed.
- the apostrophe is defined as an XML character with the mnemonic apos but this is not recognized by most browsers. Use the numeric 39 instead.
Unsupported codes
Not all devices or implementations of this standard support all of the characters defined in the standard. Here are some of the exceptions:
- Gemstar devices: no support for 164, 178, 179, 185, 188, 189, 190, 208, 222, 240, 254.
- ETI devices: no support for 164, 188, 189, 190, 208, 222, 240, 254
- MobiPocket and Amazon Kindle devices: 126 mapped to – instead of Tilde.
- PML supports all codes.
Coverage
- Modern languages with complete coverage of their alphabet
|
|
|
ISO-8859-15
The ISO-8859-15 standard (also known as "Latin alphabet no. 9" or simply Latin-9) is designed to address some of the shortcomings in the original ISO-8859-1 standard. The idea is to fully support a few more western languages and add the Euro symbol by replacing some little used symbols in 8859-1. These symbols were previously included in the Windows-1252 character set.
Changes from ISO-8859-1
| Position | 164 | 166 | 168 | 180 | 184 | 188 | 189 | 190 |
|---|---|---|---|---|---|---|---|---|
| 8859-1 | ¤ | ¦ | ¨ | ´ | ¸ | ¼ | ½ | ¾ |
| 8859-15 | € | Š | š | Ž | ž | Œ | œ | Ÿ |
€ became necessary when the Euro was introduced. The rest were excluded from ISO 8859-1 because it was motivated by information exchange and not typography. Š, š, Ž, and ž are used in some loanwords and transliteration of Russian names in Finnish and Estonian typography. Œ and œ are French ligatures, and Ÿ is needed in French all-caps text, as it is present in a few proper names such as the city of l'Haÿ-les-Roses.
Extra Languages covered
- Estonian
- Dutch (minus the IJ, ij ligatures)
- Finnish
- French
- Malay
- Tagalog

