Rich Text Format format (.rtf)
RTF (Rich Text Format) is Microsoft's text-based document interchange format, structured as a stream of control words and groups. It is also the carrier for Trados-tagged bilingual RTF. neokapi walks the RTF token stream, extracts text from paragraph runs as translatable Blocks, and skips non-translatable destinations — font, colour, and stylesheet tables, document info, pictures, and fields.
neokapi reads and writes RTF. It decodes the common RTF escapes on read —
\uN Unicode escapes, \'HH hex escapes, and special-character control
words such as \par, \tab, \emdash, and the smart-quote words — and
preserves non-translatable structure as skeleton. By default it does not
extract header and footer text, annotation (comment) text, or bookmark text;
the upstream Okapi RTF filter exposes no parameters, and these neokapi-side
extraction toggles are not surfaced as a configurable JSON schema in this
reference.
This format has no configurable parameters.
Processing notes
Translatable text comes from paragraph runs; control tables, info, pictures, and fields are skipped.
Decodes
\uNUnicode escapes,\'HHhex escapes, and special-character control words (\par,\tab,\emdash, smart quotes).Non-translatable destinations are preserved as skeleton for round-trips.
Limitations
Header/footer, annotation (comment), and bookmark text are not extracted by default.
The neokapi extraction toggles have no upstream Okapi parameter and are not surfaced as configurable schema properties in this reference.
← Back to the Format Reference