
A common encoding for many files from the web is cp1252 and I would suggest you try that first. HTML files from different sources often have different encodings, so you may have to change this setting repeatedly. Now when you add HTML files to calibre they will be correctly processed. To do this go to Preferences → Advanced → Plugins → File type and customize the HTML to ZIP plugin, telling it what encoding your HTML files are in. When adding HTML files to calibre, you may need to tell calibre what encoding the files are in. The command-line tools have an ebook-convert-txt-input -input-encoding option. This can be done in the GUI via the Input character encoding field in the Look & feel → Text section of the conversion dialog. Knowing the encoding of the source file: calibre tries to guess what character encoding your source files use, but often, this is impossible, so you need to tell it what encoding to use. How do I convert my file containing non-English characters, or smart quotes? ¶ There are two aspects to this problem: For a list of the various issues you will encounter when converting PDF, see: Convert PDF documents. PDF is a terrible format to convert from. In order of decreasing preference: LIT, MOBI, AZW, EPUB, AZW3, FB2, FBZ, DOCX, HTML, PRC, ODT, RTF, PDB, TXT, PDF I converted a PDF file, but the result has various problems? ¶ What are the best source formats to convert? ¶

azw3 file extensions.ĭOCX files from Microsoft Word 2007 and newer are supported. MOBI books can be of two types Mobi6 and KF8. These are typically generated by OCR software. calibre supports eReader, Plucker (input only), PML and zTxt PDB files.ĭJVU support is only for converting DJVU files that contain embedded text. PRC is a generic format, calibre supports PRC files with TextRead and MOBIBook headers. Output Formats: AZW3, EPUB, DOCX, FB2, HTMLZ, OEB, LIT, LRF, MOBI, PDB, PMLZ, RB, PDF, RTF, SNB, TCR, TXT, TXTZ, ZIP Input Formats: AZW, AZW3, AZW4, CBZ, CBR, CB7, CBC, CHM, DJVU, DOCX, EPUB, FB2, FBZ, HTML, HTMLZ, LIT, LRF, MOBI, ODT, PDF, PRC, PDB, PML, RB, RTF, SNB, TCR, TXT, TXTZ It can convert every input format in the following list, to every output format. What formats does calibre support conversion to/from? ¶Ĭalibre supports the conversion of many input formats to many output formats.

How do I use some of the advanced features of the conversion tools? The EPUB I produced with calibre is not valid? How do I convert a collection of HTML files in a specific order? What’s the deal with Table of Contents in MOBI files? How do I convert my file containing non-English characters, or smart quotes? I converted a PDF file, but the result has various problems? What are the best source formats to convert? What formats does calibre support conversion to/from?
