PDF (PDFium) format (.pdf)
Text + geometry extraction via PDFium.
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
geometry | boolean | false | Emit one block per positioned text rectangle, each carrying a bounding box (for the visual layout view), instead of one plain-text block per page. |
glyphs | boolean | false | Additionally attach per-character bounding boxes to each text block (implies geometry). Enables character-precise highlighting; slightly slower. |
Configure these parameters interactively and copy the YAML on the Format Reference.
← Back to the Format Reference