Skip to main content

EPUB E-Book format (.epub)

EPUB is the open e-book format: a ZIP container (OCF) packaging XHTML and SVG content documents alongside a Package Document (OPF) that lists the manifest and the reading-order spine. neokapi reads the META-INF/container.xml entry to locate the OPF, parses the OPF to resolve the spine, and walks each spine item as a child layer of the document.

neokapi reads and writes EPUB, round-tripping the container so the translated output remains a valid e-book. Content documents are parsed as embedded XHTML: when an HTML sub-format reader is wired, each spine item is delegated to it; otherwise neokapi falls back to a direct XHTML walk that treats common block elements (paragraphs, headings, list items, table cells, figure and table captions, blockquotes, and the document title) as translatable. The OPF, container metadata, and non-content resources are preserved as skeleton. This format has no configurable JSON schema in this reference.

IDepub
SourceBuilt-in
Extensions.epub
MIME Typesapplication/epub+zip
CapabilitiesRead + Write

This format has no configurable parameters.

Processing notes

  • The OPF is located via META-INF/container.xml; spine items are walked in reading order, each as a child layer.

  • Content documents are processed as embedded XHTML, either through an HTML sub-format reader or a direct token walk.

  • The container structure, OPF, and non-content resources are preserved as skeleton for round-trips.

Limitations

  • The fallback XHTML walk extracts a fixed set of block elements; richer extraction depends on an HTML sub-format reader being wired into the pipeline.

  • Spine items not listed in the OPF reading order are not walked for translatable text.

← Back to the Format Reference