Encoding Convert tool
The Encoding Convert tool validates and normalizes block text against a target character encoding. It round-trips the text — encoding to the target charset and decoding back to UTF-8 — which surfaces and replaces characters the target cannot represent. The chosen encoding name is recorded in a block property so a downstream writer can emit the document in that encoding.
Before converting, the tool can decode escape sequences found in the input —
numeric character references, HTML character entity references, and
Java-style \uXXXX escapes — so that the real characters are evaluated
against the target encoding. A target encoding name is required, and a target
locale is required when applying to the target.
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
applySource | boolean | false | Apply encoding conversion to source text |
applyTarget | boolean | true | Apply encoding conversion to target text |
escapeAll | boolean | false | Escape all extended (non-ASCII) characters in output |
reportUnsupported | boolean | true | Report characters not supported by the target encoding |
targetEncoding | string | Target encoding name (e.g. utf-8 or iso-8859-1 or shift-jis) | |
targetLocale | string | Target locale for processing | |
unescapeCER | boolean | true | Unescape HTML character entity references (e.g. á) when reading input |
unescapeJava | boolean | true | Unescape Java-style \\uXXXX escape sequences when reading input |
unescapeNCR | boolean | true | Unescape numeric character references (e.g. á) when reading input |
Configure these parameters interactively and copy the flow-step YAML on the Tool Reference.
Examples
Normalize targets to ISO-8859-1
Validate target text against Latin-1 before writing.
targetEncoding: iso-8859-1 targetLocale: fr-FR
Decode entities, then normalize to Shift-JIS
Unescape HTML entities in input and convert to a Japanese encoding.
targetEncoding: shift-jis targetLocale: ja-JP unescapeCER: true
Processing notes
Operates on translatable blocks only; non-translatable blocks pass through unchanged.
The target encoding name is written to a block property for downstream writers.
Limitations
Conversion validates representability by round-tripping through the encoding; characters the encoding cannot represent are replaced.
← Back to the Tool Reference