ISO 8859-5

From MobileRead
Jump to: navigation, search

ISO 8859-5 is an ISO standard Character set for Cyrillic but has never really caught on in practice. It is not nearly as popular as Windows-1251 which covers the same alphabet but also adds some extra characters.

Contents

[edit] History

The Eastern Europe/Western Asia developed writing in the 9th century. Originally the alphabet, called Glagolitic, was developed by Christian Monks who were trying to introduce Christianity to the Macedonia region. They translated the Bible using this alphabet. It was derived from the Greek alphabet with new symbols added for unique sounds of the Macedonia languages. Several years later the Bulgarians developed their own version which became the Cyrillic alphabet. It used many of the earlier symbols and was adopted by most countries in the region including Russia. Both alphabets were in use for several centuries but Cyrillic was eventually adopted in the 19th century and continues today.

It is currently used exclusively or as one of several alphabets for more than 50 languages, notably Belarusian, Bulgarian, Kazakh, Kyrgyz, Macedonian, Montenegrin (spoken in Montenegro; also called Serbian), Russian, Serbian, Tajik (a dialect of Persian), Turkmen, Ukrainian, and Uzbek. It consists of 32 characters.

[edit] Alphabet

Cyrillic

  • upper case: А Б В Г Д Е Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
  • lower case: а б в г д е ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

Eastern Europe

  • others: № Ё ё Ђ ђ Ѓ ѓ Є є Ѕ ѕ І і Ї ї Ј ј Љ љ Њ њ Ћ ћ Ќ ќ § Ў ў Џ џ

Uppercase Cyrillic was based on Greek characters with extra characters added where the Greek had no equivalent sound. In addition the Greek characters not needed were removed. Thus you see a similar order for the letters with new letters inserted as needed.

[edit] 8859-5 Code page layout

The following table shows ISO 9959-5. Each character is shown with its decimal code and its Unicode equivalent.

The full ISO 8859-5 includes ASCII values. The ones shown here are unique to this coding. This code is related to the Windows-1251 coding system but does not include all of the characters in that system nor are the code combinations the same.

UTF-8 Character Code Description[1]
U+00A0 160 nbsp;
U+0401 Ё 161 IOcy;
U+0402 Ђ 162 DJcy;
U+0403 Ѓ 163 GJcy;
U+0404 Є 164 Jukcy;
U+0405 Ѕ 165 DScy;
U+0406 І 166 Iukcy;
U+0407 Ї 167 YIcy;
U+0408 Ј 168 Jsercy;
U+0409 Љ 169 LJcy;
U+040A Њ 170 NJcy;
U+040B Ћ 171 TSHcy;
U+040C Ќ 172 KJcy;
U+00AD 173 shy; (soft hyphen)
U+040E Ў 174 Ubrcy;
U+040F Џ 175 DZcy;
U+0410 А 176 Acy;
U+0411 Б 177 Bcy;
U+0412 В 178 Vcy;
U+0413 Г 179 Gcy;
U+0414 Д 180 Dcy;
U+0415 Е 181 IEcy;
U+0416 Ж 182 ZHcy;
U+0417 З 183 Zcy;
U+0418 И 184 Icy;
U+0419 Й 185 Jcy;
U+041A К 186 Kcy;
U+041B Л 187 Lcy;
U+041C М 188 Mcy;
U+041D Н 189 Ncy;
U+041E О 190 Ocy;
U+041F П 191 Pcy;
U+0420 Р 192 Rcy;
U+0421 С 193 Scy;
U+0422 Т 194 Tcy;
U+0423 У 195 Ucy;
U+0424 Ф 196 Fcy;
U+0425 Х 197 KHcy;
U+0426 Ц 198 TScy;
U+0427 Ч 199 CHcy;
U+0428 Ш 200 SHcy;
U+0429 Щ 201 SHCHcy;
U+042A Ъ 202 HARDcy;
U+042B Ы 203 Ycy;
U+042C Ь 204 SOFTcy;
U+042D Э 205 Ecy;
U+042E Ю 206 YUcy;
U+042F Я 207 YAcy;
U+0430 а 208 acy;
U+0431 б 209 bcy;
U+0432 в 210 vcy;
U+0433 г 211 gcy;
U+0434 д 212 dcy;
U+0435 е 213 iecy;
U+0436 ж 214 zhcy;
U+0437 з 215 zcy;
U+0438 и 216 icy;
U+0439 й 217 jcy;
U+043A к 218 kcy;
U+043B л 219 lcy;
U+04eC м 220 mcy;
U+043D н 221 ncy;
U+043E о 222 ocy;
U+043F п 223 pcy;
U+0440 р 224 rcy;
U+0441 с 225 scy;
U+0442 т 226 tcy;
U+0443 у 227 ucy;
U+0444 ф 228 fcy;
U+0445 х 229 khcy;
U+0446 ц 230 tscy;
U+0447 ч 231 chcy;
U+0448 ш 232 shcy;
U+0449 щ 233 shchcy;
U+044A ъ 234 hardcy;
U+044B ы 235 ycy;
U+044C ь 236 softcy;
U+044D э 237 ecy;
U+044E ю 238 yucy;
U+044F я 239 yacy;
U+2116 240 numero; [2]
U+0451 ё 241 iocy;
U+0452 ђ 242 djcy;
U+0453 ѓ 243 gjcy;
U+0454 є 244 jukcy;
U+0455 ѕ 245 dscy;
U+0456 і 246 iukcy;
U+0457 ї 247 yicy;
U+0458 ј 248 jsercy;
U+0459 љ 249 ljcy;
U+045A њ 250 njcy;
U+045B ћ 251 tshcy;
U+045C ќ 252 kjcy;
U+00A7 § 253 sect;[3]
U+045E ў 254 ubrcy;
U+045F џ 255 dzcy;

[edit] Notes

  1. Unless otherwise indicated the codes listed in the description are the same as ISO-8859-1 Names ending in cy; are from named character references.
  2. Not the same as ISO-8859-1.
  3. section Not the same as ISO-8859-1.

[edit] For more information

Personal tools
Namespaces

Variants
Actions
Navigation
MobileRead Networks
Toolbox