Skip to main content

PDF (PDFium) format (.pdf)

Text + geometry extraction via PDFium.

IDpdf
SourcePlugin
Extensions.pdf
MIME Typesapplication/pdf
CapabilitiesRead

Parameters

ParameterTypeDefaultDescription
geometrybooleanfalseEmit one block per positioned text rectangle, each carrying a bounding box (for the visual layout view), instead of one plain-text block per page.
glyphsbooleanfalseAdditionally attach per-character bounding boxes to each text block (implies geometry). Enables character-precise highlighting; slightly slower.

Configure these parameters interactively and copy the YAML on the Format Reference.

← Back to the Format Reference