Gå til hovedinnhold

Encoding Detect tool

The Encoding Detect tool inspects the source text of every translatable block and records its encoding characteristics on the block's properties: a detected label (ascii, utf-8, or unknown) plus boolean flags for whether the text is valid UTF-8 and whether it is ASCII-only. Text that is ASCII-only is labelled ascii; otherwise valid UTF-8 is labelled utf-8; anything else is unknown.

The tool takes no configuration. It is useful for spotting non-ASCII content that may need particular care in downstream formats or fonts, and for confirming that text is well-formed UTF-8.

IDencoding-detect
SourceBuilt-in
Categoryanalysis
Cardinalitymonolingual

This tool has no configurable parameters.

Examples

Detect source encoding

The tool takes no parameters; add it to a flow with an empty configuration.

{}

Processing notes

  • Operates on translatable blocks; non-translatable structure passes through unchanged.

Limitations

  • Examines source text only; target translations are not classified.

  • Detection is limited to ASCII / UTF-8 validity, not a full charset-detection heuristic.

← Back to the Tool Reference