ISO 8859-5
ISO 8859-5 is an ISO standard Character set for Cyrillic but has never really caught on in practice. It is not nearly as popular as Windows-1251 which covers the same alphabet but also adds some extra characters.
Contents |
[edit] History
The Eastern Europe/Western Asia developed writing in the 9th century. Originally the alphabet, called Glagolitic, was developed by Christian Monks who were trying to introduce Christianity to the Macedonia region. They translated the Bible using this alphabet. It was derived from the Greek alphabet with new symbols added for unique sounds of the Macedonia languages. Several years later the Bulgarians developed their own version which became the Cyrillic alphabet. It used many of the earlier symbols and was adopted by most countries in the region including Russia. Both alphabets were in use for several centuries but Cyrillic was eventually adopted in the 19th century and continues today.
It is currently used exclusively or as one of several alphabets for more than 50 languages, notably Belarusian, Bulgarian, Kazakh, Kyrgyz, Macedonian, Montenegrin (spoken in Montenegro; also called Serbian), Russian, Serbian, Tajik (a dialect of Persian), Turkmen, Ukrainian, and Uzbek. It consists of 32 characters.
[edit] Alphabet
Cyrillic
- upper case: А Б В Г Д Е Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
- lower case: а б в г д е ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я
Eastern Europe
- others: № Ё ё Ђ ђ Ѓ ѓ Є є Ѕ ѕ І і Ї ї Ј ј Љ љ Њ њ Ћ ћ Ќ ќ § Ў ў Џ џ
Uppercase Cyrillic was based on Greek characters with extra characters added where the Greek had no equivalent sound. In addition the Greek characters not needed were removed. Thus you see a similar order for the letters with new letters inserted as needed.
[edit] 8859-5 Code page layout
The following table shows ISO 9959-5. Each character is shown with its decimal code and its Unicode equivalent.
The full ISO 8859-5 includes ASCII values. The ones shown here are unique to this coding. This code is related to the Windows-1251 coding system but does not include all of the characters in that system nor are the code combinations the same.
UTF-8 | Character | Code | Description[1] |
---|---|---|---|
U+00A0 | 160 | nbsp; | |
U+0401 | Ё | 161 | IOcy; |
U+0402 | Ђ | 162 | DJcy; |
U+0403 | Ѓ | 163 | GJcy; |
U+0404 | Є | 164 | Jukcy; |
U+0405 | Ѕ | 165 | DScy; |
U+0406 | І | 166 | Iukcy; |
U+0407 | Ї | 167 | YIcy; |
U+0408 | Ј | 168 | Jsercy; |
U+0409 | Љ | 169 | LJcy; |
U+040A | Њ | 170 | NJcy; |
U+040B | Ћ | 171 | TSHcy; |
U+040C | Ќ | 172 | KJcy; |
U+00AD | 173 | shy; (soft hyphen) | |
U+040E | Ў | 174 | Ubrcy; |
U+040F | Џ | 175 | DZcy; |
U+0410 | А | 176 | Acy; |
U+0411 | Б | 177 | Bcy; |
U+0412 | В | 178 | Vcy; |
U+0413 | Г | 179 | Gcy; |
U+0414 | Д | 180 | Dcy; |
U+0415 | Е | 181 | IEcy; |
U+0416 | Ж | 182 | ZHcy; |
U+0417 | З | 183 | Zcy; |
U+0418 | И | 184 | Icy; |
U+0419 | Й | 185 | Jcy; |
U+041A | К | 186 | Kcy; |
U+041B | Л | 187 | Lcy; |
U+041C | М | 188 | Mcy; |
U+041D | Н | 189 | Ncy; |
U+041E | О | 190 | Ocy; |
U+041F | П | 191 | Pcy; |
U+0420 | Р | 192 | Rcy; |
U+0421 | С | 193 | Scy; |
U+0422 | Т | 194 | Tcy; |
U+0423 | У | 195 | Ucy; |
U+0424 | Ф | 196 | Fcy; |
U+0425 | Х | 197 | KHcy; |
U+0426 | Ц | 198 | TScy; |
U+0427 | Ч | 199 | CHcy; |
U+0428 | Ш | 200 | SHcy; |
U+0429 | Щ | 201 | SHCHcy; |
U+042A | Ъ | 202 | HARDcy; |
U+042B | Ы | 203 | Ycy; |
U+042C | Ь | 204 | SOFTcy; |
U+042D | Э | 205 | Ecy; |
U+042E | Ю | 206 | YUcy; |
U+042F | Я | 207 | YAcy; |
U+0430 | а | 208 | acy; |
U+0431 | б | 209 | bcy; |
U+0432 | в | 210 | vcy; |
U+0433 | г | 211 | gcy; |
U+0434 | д | 212 | dcy; |
U+0435 | е | 213 | iecy; |
U+0436 | ж | 214 | zhcy; |
U+0437 | з | 215 | zcy; |
U+0438 | и | 216 | icy; |
U+0439 | й | 217 | jcy; |
U+043A | к | 218 | kcy; |
U+043B | л | 219 | lcy; |
U+04eC | м | 220 | mcy; |
U+043D | н | 221 | ncy; |
U+043E | о | 222 | ocy; |
U+043F | п | 223 | pcy; |
U+0440 | р | 224 | rcy; |
U+0441 | с | 225 | scy; |
U+0442 | т | 226 | tcy; |
U+0443 | у | 227 | ucy; |
U+0444 | ф | 228 | fcy; |
U+0445 | х | 229 | khcy; |
U+0446 | ц | 230 | tscy; |
U+0447 | ч | 231 | chcy; |
U+0448 | ш | 232 | shcy; |
U+0449 | щ | 233 | shchcy; |
U+044A | ъ | 234 | hardcy; |
U+044B | ы | 235 | ycy; |
U+044C | ь | 236 | softcy; |
U+044D | э | 237 | ecy; |
U+044E | ю | 238 | yucy; |
U+044F | я | 239 | yacy; |
U+2116 | № | 240 | numero; [2] |
U+0451 | ё | 241 | iocy; |
U+0452 | ђ | 242 | djcy; |
U+0453 | ѓ | 243 | gjcy; |
U+0454 | є | 244 | jukcy; |
U+0455 | ѕ | 245 | dscy; |
U+0456 | і | 246 | iukcy; |
U+0457 | ї | 247 | yicy; |
U+0458 | ј | 248 | jsercy; |
U+0459 | љ | 249 | ljcy; |
U+045A | њ | 250 | njcy; |
U+045B | ћ | 251 | tshcy; |
U+045C | ќ | 252 | kjcy; |
U+00A7 | § | 253 | sect;[3] |
U+045E | ў | 254 | ubrcy; |
U+045F | џ | 255 | dzcy; |
[edit] Notes
- ↑ Unless otherwise indicated the codes listed in the description are the same as ISO-8859-1 Names ending in cy; are from named character references.
- ↑ Not the same as ISO-8859-1.
- ↑ section Not the same as ISO-8859-1.