Skip to main content

Rich Text Format format (.rtf)

RTF (Rich Text Format) is Microsoft's text-based document interchange format, structured as a stream of control words and groups. It is also the carrier for Trados-tagged bilingual RTF. neokapi walks the RTF token stream, extracts text from paragraph runs as translatable Blocks, and skips non-translatable destinations — font, colour, and stylesheet tables, document info, pictures, and fields.

neokapi reads and writes RTF. It decodes the common RTF escapes on read — \uN Unicode escapes, \'HH hex escapes, and special-character control words such as \par, \tab, \emdash, and the smart-quote words — and preserves non-translatable structure as skeleton. By default it does not extract header and footer text, annotation (comment) text, or bookmark text; the upstream Okapi RTF filter exposes no parameters, and these neokapi-side extraction toggles are not surfaced as a configurable JSON schema in this reference.

IDrtf
SourceBuilt-in
Extensions.rtf
MIME Typesapplication/rtf, text/rtf
CapabilitiesRead + Write

This format has no configurable parameters.

Processing notes

  • Translatable text comes from paragraph runs; control tables, info, pictures, and fields are skipped.

  • Decodes \uN Unicode escapes, \'HH hex escapes, and special-character control words (\par, \tab, \emdash, smart quotes).

  • Non-translatable destinations are preserved as skeleton for round-trips.

Limitations

  • Header/footer, annotation (comment), and bookmark text are not extracted by default.

  • The neokapi extraction toggles have no upstream Okapi parameter and are not surfaced as configurable schema properties in this reference.

← Back to the Format Reference