E-book formats

From MobileRead

Jump to: navigation, search

Contents

[edit] eBooks Formats

This section attempts to define and identify all (or most) of the eBooks formats. With the great proliferation of formats a new user can easily become confused. The most important ones are always the ones that work on the device or devices you own but if you have a choice the most important ones are the ones that have the most eBook dealers or most eBooks available. Today these formats include AZW Amazon, MOBI MobiPocket, LIT Microsoft, PNPd Palm eReader, and BBeB Sony. Another very popular format is PDF from Adobe, however it does not tend to work as well on portable mobile devices due to the screen size. It is more suited to computers since it often expects full paper size pages. Some of the formats in the list below are only available for a few or even one type of device. Some are more standardized. Be sure your device will read the format you choose.

  • AZW -€“ The Amazon proprietary format. Seems to be a MOBI format sometimes with and sometimes without DRM.
  • AZW1 -€“ The Amazon proprietary format. It is the TPZ format always with a custom DRM.
  • BBeB -€“ The Sony proprietary format. Stands for Broadband EBook
  • CBR/CBZ -€“ Compressed container for images. The R means it is RAR compressed while Z means it is zipped compressed. The internal format can be any of several image formats and CBR/CBZ readers display these as multiple pages of a book. The name means Comic Book Reader but the use is for any book where the content contains basically pictures.
  • CHM -€“ Compressed HTML, often used for Windows help files. It has become very popular for distribution of texts and other support materials over the Web.
  • DJVU -€“ format by Lizardtech that is more and more widely used for scientific publications. Its main characteristics is that the compress ratio is about 10x better than in .pdf format at the same quality. Nothing beats it at the moment for b&w text and pictures.
  • DOC - This format could be a document in Microsoft Word format, which uses a .doc extension, or in PalmDOC
  • EBA - proprietary eBook format. If you have information on it, please add it here.
  • EBAML - EBA 2.0 see above - Used on the Dr. Yi Reader
  • ePUB standard. An open format defined by the Open eBook Forum of the International Digital Publishing Forum (<idpf>). It is based on XHTML and XML. It is an evolving standard. Current specifications are found at the idpf web site.
  • FB2 -€“ FictionBook format, based on XML and viewable by various e-book software solutions for Windows, Linux, PocketPC and Palm OS. Used by the HaaliReader, FBReader and the PalmFiction.
  • FUB -€“ Franklin eBook format.
  • GPF -€ Ganaxa Publishing Format, allowing hot spots and embedded rich media content.
  • GPX -€ A protected Ganaxa document.“
  • HTML -€“ Hyper Text Markup Language is the backbone of the World Wide Web. Many texts are distributed in this format. In addition, some eBook readers support Cascading Style Sheets (CSS) that are basically a master style guide for HTML pages.
  • IMP -€“ an eBook format used by RCA REB 1100/REB 1200, eBook Technologies ETI-1/ETI-2, Gemstar 1150, and EBookwise-1150. Some programs convert to it. No known program converts from it.
  • -IS.PDB - A Palm database format for the ISilo reader. The IS is used to distinguish this format.
  • KML -€“ HieBook eBook format.
  • LIT -€“ Microsoft's native format for its Microsoft Reader
  • LRF -€“ also: BBeB book. Sony's proprietary format. Supported by the Sony Librie and Sony Reader
  • LRS -€“ also: BBeB Xylog XML. Source format for BBeB books, which is compiled into LRF for reading on the device
  • LRX -€“ A protected BBeB document. The Sony Librie and Sony Reader use mutually incompatible formats
  • MOBI -€“ MobiPocket format, usable with MobiPocket's own reading software on almost any PDA and Smartphones. Mobipocket's Windows PC software can convert .chm, .doc, .html, .ocf, .pdf, .rtf, and .txt files to this format
  • OEB -€“ Open eBook format. an ebook format used by EBookwise-1150, MobiPocket, and Microsoft Reader. This standard is an older version of ePUB
  • PDB -€“ Palm Database File. Can hold several different e-book formats targeting Palm-enabled devices, commonly used for PalmDOC (AportisDoc) e-books and eReader formats as well and many others.
  • PDF -€“ Portable Document Format created by Adobe for their Acrobat products. It is the defacto standard for document interchange. Software support exists for almost every computer platform and handheld device. Some devices, the Sony Portable Reader among them, have problems with PDF since most content available is scaled for either A4 or letter format, both of which are not easily readable when reduced to fit on small screens. Some e-book readers, including the iRex iLiad, have a pan-and-zoom feature that aids readability, but extracts a price in ergonomics.
  • PKG a format used on the Apple Newton.
  • PNPd -€ format used by the eReader program. This is a popular format.
  • PRC -€“ Palm Resource File. Often holds a Mobipocket eBook but occasionally holds an eReader or AportisDoc eBook. Most often it is a Palm executable.
  • RB -€“ Rocket eBook format made for the Rocket eBook device and the Gemstar RCA REB 1100.
  • RTF -€“ Rich Text Format is a document interchange format available supported by some e-book readers, and also by many Word processing applications including MS Word and OpenOffice. It is the preferred format for many users who create their own content for the Sony Reader.
  • STK -€“ STAReBOOK's proprietary format
  • TCR -€“ eBook for EPOC
  • TeBR -€“ Tiny Ebook Reader format also used by Fictionwise.
  • TPZ - Topaz file extension used on Amazon Kindle
  • TXT -€“ Text is the base type, with no formatting applied other than space, paragraph, end of line, new line, and tab. It is usable in many e-book devices.
  • TR -€“ Tome Raider file. Tome Raider is an eBook format that features support for very large books such as reference books, encyclopedias, and dictionaries. Their latest format is called TR3.
  • VBK -€“ eBook format from VitalSource that features graphics support.
  • XEB - format used by Apbi eBooks in Chinese primarily.
  • XHTML - specialized version of HTML designed to conform to XML rules of construction. It is the standard format for epub data.€“
  • XML - general purpose markup language for exchange of data. In the context of eBooks it is generally confined to XHTML and RSS feeds although some other formats have been defined.
  • WOLF -€“ Proprietary format used by HanLin eBook in their V2B, V3, and V8 eBook readers. Usually a .wol extension is used. Also used by JCNIP on their Dr. Yi ebook reader.
  • zTXT -€“ format used by the WeaselReader on Palm devices. Has a .pdb extension.

[edit] Other File Types

A number of e-book readers and PDAs play music (usually in MP3) and display graphics (usually in JPG).

[edit] Music Formats

  • AA -€“ Audible.com Audio proprietary format with four different levels of DRM
  • AAC -€“ Advanced Audio Codec is more of a container than a format as within an AAC the music can be encoded in multiple ways from iTunes M4P all the way to a lossless compression
  • MP3 -€“ the currently most popular music compression format. It is widely used throughout the Internet and plays on almost every portable music player. This format is also used for some audio books.
  • WMA - Windows Media Audio is an audio format developed by Microsoft to compete with MP3.

[edit] Graphic Formats

  • BMP -€“ BitMaP image file is an uncompressed graphics format developed by Microsoft.
  • GIF -€“ Graphics Interchange Format was developed in 1987 by CompuServe and is a lossless graphics format designed for the reproduction of line drawings rather than photographs. Widely used on the Internet for logotypes and drawings. The coding scheme is patented as it uses the LZW lossless compression scheme however the patents ran out in 2004. PNG was developed to replace GIF and has no patent issues. Main drawbacks are color support (max. 256 colors) and only 1-bit alpha channel (transparency bit).
  • JPG -€“ (or JPEG) stands for Joint Photographic Experts Group and a lossy compressed graphics format designed to support photographs rather than line art. Developed in 1992 and issued as the ISO 10918-1 standard in 1994, the quality depends directly on the amount of compression employed. Widely used on the Internet and by most digital camera manufacturers. A newer format is called JPEG 2000.
  • PNG -€“ Portable Network Graphics format is a bitmapped graphic format that employs a lossless compression system. Designed to improve upon and replace GIF files, PNG does not require a patent license. Main drawback is the complexity of its color model.
  • TIF -€“ (or TIFF) Tagged Image File Format is a container that can hold images in a wide variety of bitmapped or even vector formats. The can also be compressed or uncompressed. If compressed they can use RLE, JPG, LZW, Zip or potentially other formats. This standard is owned by Adobe. Main drawback is that it is so versatile that saying that TIF is a supported format may mean nothing since there are really many TIFF formats.

[edit] Compressed Formats

These are lossless compression formats that reduce the amount of space required to store a document. Text, unlike music, can be compressed a great deal. Sometimes the compression can be as much as 90%.

Don't confuse compression formats with e-book formats. Although listed by some e-book readers as a supported format, these readers have only the ability to extract the compressed file and to get to the file or files inside. The reader must still support the actual underlying e-book format. Also, some e-book formats already include compression.

  • RAR -€“ a file compression system providing one of the most compact resultant files current available in wide distribution. The premier tool for RAR is WinRAR
  • ZIP -€“ the most universal of the compression tools. Support for ZIP files is built into Windows XP, Windows MCP, and Windows Vista. Slightly less efficient than RAR files, ZIP files have been around longer and enjoy more support. The standard has been released to the public by its creator, Phil Katz. The two most prominent ZIP utilities are PKZip and WinZip. Info-Zip is an open source command line version that is available for many different operating systems. There is also an open source option at Yahoo widgets where there is a simple utility called ZIPPER which works effortlessly. You simply drag your file into the window for zipping or unzipping.
  • LHA - a Japanese developed compressed archive file format. A Microsoft Compressed (LZH) Folder Add-on is included with the Japanese version of Windows to use this format.

[edit] Supported Format Matrix

Brandname
HanLin eBook
HanLin eBook
HanLin eBook
Sony Portable Reader
iRex iLiad
Amazon Kindle
STAReBOOK
Bookeen
eBookwise 1150
REB 1200
Model
V8
V3
V2
PRS-500 PRS-505
ER-0100
D00111
STK-101
"Gen 3"
1150
1200
Manufacturer
Jinke
Jinke
Jinke
Sony
iRex Technologies
Hon Hai Precision
eREAD
eREAD
ETI
ETI
Display Type
e-ink
e-ink
e-ink
e-ink
e-ink
e-ink
e-ink
e-ink
Backlit Gray LCD
Backlit Color LCD
eBooks
AZW
Yes
BBeB
Yes
CHM
Yes
Yes
DOC
Yes
Yes
Note 1
Note 1
Note 1
HTML
Yes
Yes
Yes
Yes
Yes
Note 2
Note 2
IMP
Yes
Yes
MOBI
Yes
Yes
Yes
PalmDOC
Yes
PRC
Yes
Yes
Yes
PDF
Yes
Yes
Yes
Yes
Yes
Yes
RB
Note 2
Note 2
RTF
Yes
Yes
Yes
Note 2
Note 2
STK
Yes
TXT
Yes
Yes
Yes
Yes
Yes
Yes
Yes
Note 2
Note 2
Wolf
Yes
Yes
Yes
XEB
Yes
Yes
DJVU
Yes
FB2
Yes
Music
AAC
Yes
MP3
Yes
Yes
Yes
Yes
Yes
Yes
Yes
Graphics
BMP
Yes
Yes
Note 3
Note 3
GIF
Yes
Yes
Yes
Note 3
Note 3
JPG
Yes
Yes
Yes
Yes
Yes
Yes
Note 3
Note 3
PNG
Yes
Yes
Yes
Yes
Note 3
Note 3
TIF
Yes
Compressed
RAR
Yes
Yes
ZIP
Yes

Note 1 - Requires a manufacturer supplied conversion program and Word.

Note 2 - Requires a manufacturer supplied conversion program.

Note 3 - Only supported inside of a document.

Personal tools
MobileRead Networks