E-book formats
From MobileRead
Contents |
[edit] eBooks Formats
This section attempts to define and identify all (or most) of the eBooks formats. With the great proliferation of formats a new user can easily become confused. The most important ones are always the ones that work on the device or devices you own but if you have a choice the most important ones are the ones that have the most eBook dealers or most eBooks available. Today these formats include AZW Amazon, MOBI MobiPocket, LIT Microsoft, PNPd Palm eReader, and BBeB Sony. Another very popular format is PDF from Adobe, however it does not tend to work as well on portable mobile devices due to the screen size. It is more suited to computers since it often expects full paper size pages. A new emerging eBook standard format is ePUB. Some of the formats in the list below are only available for a few or even one type of device. Some are more standardized. Be sure your device will read the format you choose.
- ABW - File format used by AbiWord. If compressed it is ZABW.
- ACSM - File format used to provide DRM on Adobe eBooks PDF and ePUB.
- AEH - File format used by eBooksWriter.
- AZW - The Amazon proprietary format. This is very close to the MOBI format sometimes with and sometimes without DRM. The DRM is unique to the Amazon Kindle.
- AZW1 - The Amazon proprietary format. It is the TPZ format always with a custom DRM.
- BBeB - The Sony proprietary format. Stands for Broadband EBook, also known as LRF because of the file extension.
- CBR/CBZ - Compressed container for images. The R means it is RAR compressed while Z means it is zipped compressed. The internal format can be any of several image formats and CBR/CBZ readers display these as multiple pages of a book. The name means Comic Book Reader but the use is for any book where the content contains basically pictures.
- CHM - Compressed HTML, often used for Windows help files. It has become very popular for distribution of texts and other support materials over the Web.
- DJVU - format by Lizardtech that is more and more widely used for scientific publications. Its main characteristics is that the compress ratio is about 10x better than in .pdf format at the same quality. Nothing beats it at the moment for b&w text and pictures.
- DNL - A digtital webbook format used by Desktop Author.
- DOC - This format could be a document in Microsoft Word format, which uses a .doc extension, or in PalmDOC
- DOCX - This is the latest Office Open XML format used in Word 2007 and later. Older versions of Word can be upgraded to support this format.
- EBA - proprietary eBook format. Specifically supported by eBook Readers for the Chinese language.
- EBAML same as EBA 2.0 - Used on the Dr. Yi Reader and other China products
- ePUB standard. An open format defined by the Open eBook Forum of the International Digital Publishing Forum (<idpf>). It is based on XHTML and XML. It is an evolving standard. Current specifications are found at the idpf web site. Adobe has DRM for this format.
- -ER.PDB - A Palm database format for the eReader reader. The ER is used to distinguish this format.
- EXE format means that the eBook is actually a PC executable. Basically the eBook includes the program to display it.
- FB2 - FictionBook format, based on XML and viewable by various e-book software solutions for Windows, Linux, PocketPC and Palm OS. Used by the HaaliReader, FBReader and the PalmFiction.
- FUB - Franklin eBook format.
- GPF - Ganaxa Publishing Format, allowing hot spots and embedded rich media content.
- GPX - A protected Ganaxa document.
- HTML - Hyper Text Markup Language is the backbone of the World Wide Web. Many texts are distributed in this format. In addition, some eBook readers support Cascading Style Sheets (CSS) that are basically a master style guide for HTML pages.
- IMP - an eBook format used by eBook Technologies ETI-1 (REB 1200/Softbook Reader) or ETI-2 (EBookwise-1150/Gemstar 1150). Some programs convert to it. It is considered a terminal format but see: IMP software.
- -IS.PDB - A Palm database format for the ISilo reader. The IS is used to distinguish this format.
- KML - HieBook eBook format.
- LIT - Microsoft's native format for its Microsoft Reader
- LRF - also: BBeB book. Sony's proprietary format. Supported by the Sony Librie and Sony Reader
- LRS - also: BBeB Xylog XML. Source format for BBeB books, which is compiled into LRF for reading on the device
- LRX - A protected BBeB document. The Sony Librie and Sony Reader use mutually incompatible formats
- MOBI - MobiPocket format, usable with MobiPocket's own reading software on almost any PDA and Smartphones. Mobipocket's Windows PC software can convert .chm, .doc, .html, .ocf, .pdf, .rtf, and .txt files to this format
- NP - format is the Newspaper download format used by NewspaperDirect for their PressDisplay product.
- OEB - Open eBook format. an ebook format used by EBookwise-1150, MobiPocket, and Microsoft Reader. This standard is an older version of ePUB
- ODT - This is an open standard, document format used in OpenOffice.org, Star Office and many other word processors.
- OSIS - This is an XML Schema definition for Bibles and other Biblical research texts.
- PDB - Palm Database File. Can hold several different e-book formats targeting Palm-enabled devices, commonly used for PalmDOC (AportisDoc) e-books and eReader formats as well and many others.
- PDF - Portable Document Format created by Adobe for their Acrobat products. It is the defacto standard for document interchange. Software support exists for almost every computer platform and handheld device. Some devices have problems with PDF since most content available is scaled for either A4 or letter format, both of which are not easily readable when reduced to fit on small screens. Some Readers can reflow some PDF documents, including the Sony PRS505, to accommodate the small screen. Some e-book readers, including the iRex iLiad, have a pan-and-zoom feature that aids readability, but extracts a price in ergonomics.
- PKG a format used on the Apple Newton.
- PNPd - format used by the eReader program. This is a popular format (Also known as PML).
- PRC - Palm Resource File. Often holds a Mobipocket eBook but occasionally holds an eReader or AportisDoc eBook. Most often it is a Palm executable.
- RB - Rocket eBook format made for the Rocket eBook device and the Gemstar RCA REB 1100.
- RTF - Rich Text Format is a document interchange format available supported by some e-book readers, and also by many Word processing applications including MS Word and OpenOffice. It is the preferred format for many users who create their own content for the Sony Reader.
- SGF - Native format for Sigil, a direct editor for ePUB.
- STK - STAReBOOK's proprietary format
- TCR - eBook for EPOC
- TeBR - Tiny Ebook Reader format also used by Fictionwise.
- TPZ - Topaz file extension used on Amazon Kindle
- TXT - Text is the base type, with no formatting applied other than space, paragraph, end of line, new line, and tab. It is usable in many e-book devices.
- TR - Tome Raider file. Tome Raider is an eBook format that features support for very large books such as reference books, encyclopedias, and dictionaries. Their latest format is called TR3.
- VBK - eBook format from VitalSource that features graphics support.
- XDXF - Dictionary exchange format based on XML.
- XEB - format used by Apbi eBooks in Chinese primarily.
- XHTML - specialized version of HTML designed to conform to XML rules of construction. It is the standard format for epub data.
- XML - general purpose markup language for exchange of data. In the context of eBooks it is generally confined to XHTML and RSS feeds although some other formats have been defined.
- WOLF - Proprietary format used by HanLin eBook in their V2B, V3, and V8 eBook readers. Usually a .wol extension is used. Also used by JCNIP on their Dr. Yi ebook reader.
- zTXT - format used by the WeaselReader on Palm devices. Has a .pdb extension.
- ZNO - proprietary format for Zinio electronic subscription magazines. These magazines include multimedia material like photos and videos. It is rumored that the format is based on DJVU.
[edit] Other File Types
A number of e-book readers and PDAs play music (usually in MP3) and display graphics (usually in JPG).
- LRC - A annotation format originally intended for Lyrics. It can also be used for read along eBooks. It is a text file that can be synced with audio or video files with an appropriate program.
[edit] Music Formats
- AA - Audible.com Audio proprietary format with four different levels of DRM
- AAC - Advanced Audio Codec is more of a container than a format as within an AAC the music can be encoded in multiple ways from iTunes M4P all the way to a lossless compression
- AAX - Enhanced audio from Audible.com. It is embedded with other features, such as images, graphs, maps, or links.
- MP3 - the currently most popular music compression format. It is widely used throughout the Internet and plays on almost every portable music player. This format is also used for some audio books.
- WMA - Windows Media Audio is an audio format developed by Microsoft to compete with MP3.
- OGG - Free, open standard container for Vorbis audio compression codec files, as well as for Free Lossless Audio Codec (FLAC), Speex speech compression codec, and Theora lossy video codec.
[edit] Graphic Formats
- BMP - BitMaP image file is an uncompressed graphics format developed by Microsoft.
- GIF - Graphics Interchange Format was developed in 1987 by CompuServe and is a lossless graphics format designed for the reproduction of line drawings rather than photographs. Widely used on the Internet for logotypes and drawings. The coding scheme is patented as it uses the LZW lossless compression scheme however the patents ran out in 2004. PNG was developed to replace GIF and has no patent issues. Main drawbacks are color support (max. 256 colors) and only 1-bit alpha channel (transparency bit).
- JPG - (or JPEG) stands for Joint Photographic Experts Group and a lossy compressed graphics format designed to support photographs rather than line art. Developed in 1992 and issued as the ISO 10918-1 standard in 1994, the quality depends directly on the amount of compression employed. Widely used on the Internet and by most digital camera manufacturers. A newer format is called JPEG 2000.
- PNG - Portable Network Graphics format is a bitmapped graphic format that employs a lossless compression system. Designed to improve upon and replace GIF files, PNG does not require a patent license. Main drawback is the complexity of its color model.
- SVG - a vector graphics format that is supported by ePUB.
- SWF - Shockwave Flash is currently the dominant format for displaying "animated" vector graphics on the Web.
- TIF - (or TIFF) Tagged Image File Format is a container that can hold images in a wide variety of bitmapped or even vector formats. They can also be compressed or uncompressed. If compressed they can use RLE, JPG, LZW, Zip or potentially other formats. This standard is owned by Adobe. Main drawback is that it is so versatile that saying that TIF is a supported format may mean nothing since there are really many TIFF formats.
[edit] Compressed Formats
These are lossless compression formats that reduce the amount of space required to store a document. Text, unlike music, can be compressed a great deal. Sometimes the compression can be as much as 90%. These formats are considered to be containers in that they can hold multiple files. In some case the ability to hold multiple files is more important than the actual compression.
Don't confuse compression formats with e-book formats. Although listed by some e-book readers as a supported format, these readers have only the ability to extract the compressed file and to get to the file or files inside. The reader must still support the actual underlying e-book format. Also, some e-book formats already include compression.
- RAR - a file compression system providing one of the most compact resultant files current available in wide distribution. The premier tool for RAR is WinRAR but 7ZIP works as well.
- ZIP - the most universal of the compression tools. Slightly less efficient than RAR files, ZIP files have been around longer and enjoy more support.
- LHA - a Japanese developed compressed archive file format. A Microsoft Compressed (LZH) Folder Add-on is included with the Japanese version of Windows to use this format.
- GZIP - A zip format (.gz) that was developed by the GNU team. It is designed to be zipped or unzipped on the fly and only supports one file. Often the file is a tar (.tar) format which is a container (archive) format. When used together the file extension is usually .tgz.
- BZIP2 - compresses files using the Burrows-Wheeler block sorting text compression algorithm, and Huffman coding. Compression is generally considerably better than that achieved by more conventional LZ77/LZ78-based compressors, and approaches the performance of the PPM (prediction by partial mapping) family of statistical compressors. The file extension is generally .bz2
[edit] Supported Format Matrix
the eBookwise and REB1200 are in the LCD Matrix
| Brandname | ||||||||||
| Model | ||||||||||
| Manufacturer | ||||||||||
| Display Type | ||||||||||
| eBooks | ||||||||||
| AZW | ||||||||||
| BBeB | ||||||||||
| CHM | ||||||||||
| DOC | ||||||||||
| ePUB | ||||||||||
| FB2 | ||||||||||
| HTML | ||||||||||
| IMP | ||||||||||
| MOBI | ||||||||||
| PalmDOC | ||||||||||
| PRC | ||||||||||
| RB | ||||||||||
| RTF | ||||||||||
| STK | ||||||||||
| TXT | ||||||||||
| Wolf | ||||||||||
| XEB | ||||||||||
| Music | ||||||||||
| AAC | ||||||||||
| MP3 | ||||||||||
| AA | ||||||||||
| Graphics | ||||||||||
| BMP | ||||||||||
| DJVU | ||||||||||
| GIF | ||||||||||
| JPG | ||||||||||
| PNG | ||||||||||
| TIF | ||||||||||
| Compressed | ||||||||||
| RAR | ||||||||||
| ZIP | ||||||||||
- Note 1 - Requires a manufacturer supplied conversion program and Word.
- Note 2 - Requires a manufacturer supplied conversion program.
- Note 3 - Only supported inside of a document.
- Note 4 - Only supported on PRS-505, PRS-700, PRS-600, PRS-300
- Note 5 - Chinese version of iLiad only
[edit] For more information
For more information on many file formats see http://www.fileformat.info/
http://www.writingonyourpalm.net/column011105.htm has information on Palm or Palm derived products primarily. Also see PDB.

