Tool Reference
Processing tools transform content as it streams through a flow — translating, validating, analyzing, and converting blocks. Tools are grouped by category below. Select one to read its documentation and configure its parameters live; the YAML output drops into a flow step. Each tool has a shareable link.
Two sources, one grid. Each card is tagged by where the tool comes from; the Built-in/Okapi bridge filter narrows by that tag:
- Built-in (54) — the native processing tools maintained in neokapi, such as
word-count,pseudo-translate, andqa-check. - Okapi bridge (0) — the pipeline steps exposed by the optional Okapi bridge plugin, for compatibility with the Java Okapi Framework.
A handful of tools exist in both sources (a built-in and an Okapi twin share an id, e.g. word-count). To keep their static pages distinct, the Okapi twin's page is suffixed with -okapi — /reference/tools/word-count for the built-in and /reference/tools/word-count-okapi for the bridge.
54 of 54 tools
analysis10
AI Entity ExtractBuilt-inDetect named entities and terminology candidates in source content using an LLM.analysis8 paramsAI Terminology ExtractionBuilt-inExtract candidate terminology from source text using an LLM provider.analysis4 paramsCharacter CountBuilt-inCount the characters in each block's source and target text, with and without spaces.analysis3 paramsCharacters ListingBuilt-inCollect the distinct characters used across source and target text, with their frequencies.analysis2 paramsEncoding DetectBuilt-inClassify the source text of each block as ASCII, UTF-8, or unknown.analysisRepetition AnalysisBuilt-inIdentify repeated source segments across blocks for translation-memory leverage.analysis1 paramScoping ReportBuilt-inClassify each block into a scoping category from upstream repetition and match status.analysisSegment CountBuilt-inCount the source and target segments in each translatable block.analysisTranslation ComparisonBuilt-inCompare two target translations of the same source and report whether they match.analysis6 paramsWord CountBuilt-inCount the words in each block's source and target text for scoping and pricing.analysis3 params
quality13
AI Brand Voice CheckBuilt-inCheck text against a brand voice profile using an LLM and score brand compliance.quality4 paramsAI QA CheckBuilt-inCheck translation quality with an LLM and record the issues it finds.quality4 paramsAI ReviewBuilt-inReview each translation with an LLM, producing a score, assessment, and suggestion.quality3 paramsBrand Vocabulary CheckBuilt-inFlag forbidden, competitor, and preferred-term violations from a brand voice profile.qualityCharacters CheckBuilt-inCheck target text for forbidden, missing, out-of-charset, or corrupted characters.quality5 paramsDo-Not-Translate CheckBuilt-inVerify that do-not-translate terms survive verbatim into the target.quality1 paramInconsistency CheckBuilt-inFlag identical source strings translated differently, or different sources sharing one translation.quality5 paramsLength CheckBuilt-inValidate target length against absolute limits and source/target ratio thresholds.quality12 paramsPattern CheckBuilt-inVerify that configured regex patterns are preserved or absent in target text.qualityPlaceholder CheckBuilt-inVerify interpolation placeholders and numbered tags are preserved in the target.quality1 paramQA CheckBuilt-inRun a suite of rule-based quality checks comparing source and target text.quality26 paramsTerminology CheckBuilt-inVerify that required target terms appear when their source terms are present.quality1 paramXML ValidationBuilt-inCheck that the source and/or target text of each block is well-formed XML.quality4 params
text-processing22
Batch CollectorBuilt-inCollect blocks into fixed-size batches before forwarding them downstream.text-processing1 paramBOM ConvertBuilt-inAdd or remove the Unicode byte-order mark (BOM) on document layers.text-processing2 paramsCase TransformBuilt-inChange the letter case of source and/or target text to upper, lower, or title case.text-processing3 paramsCreate TargetBuilt-inCreate target segment containers for a locale, optionally seeded with the source text.text-processing4 paramsEncoding ConvertBuilt-inNormalize text through a target character encoding and unescape escape sequences on input.text-processing9 paramsExternal CommandBuilt-inRun an external command-line program on block text and use its stdout as the result.text-processing7 paramsFull Width ConvertBuilt-inConvert between half-width and full-width Latin characters and punctuation for CJK content.text-processing7 paramsInline Codes RemoveBuilt-inStrip inline-code runs (placeholders and paired codes) from segments, leaving plain text.text-processing5 paramsLayer ProcessorBuilt-inApply format-specific tool chains to embedded child layers within a document.text-processingLine Break ConvertBuilt-inNormalize line endings in block text to LF, CRLF, or CR.text-processing3 paramsProperties SetBuilt-inSet or modify key-value properties on blocks.text-processing3 paramsRedactBuilt-inReplace sensitive spans with protected placeholders before processing.text-processing4 paramsRemove TargetBuilt-inRemove target segments from blocks, by locale or by selected text-unit IDs.text-processing4 paramsScriptBuilt-inRun a user-supplied ES5 JavaScript script over each part to inspect or modify block text.text-processing4 paramsSearch and ReplaceBuilt-inFind and replace text in blocks, with optional regular-expression matching and flags.text-processing7 paramsSegmentationBuilt-inSplit block text into sentence-level segments using SRX-style rules.text-processing17 paramsSpan ClassifyBuilt-inReclassify generic markup inline codes into semantic vocabulary types.text-processingTag ProtectBuilt-inIdentify tags and placeholders in source text and mark them for downstream protection.text-processing1 paramUnredactBuilt-inRestore original values into redacted content after processing.text-processingURI ConvertBuilt-inEncode or decode URI percent-escape sequences in block text.text-processing4 paramsWhitespace CorrectBuilt-inNormalize and fix whitespace issues in target translations, including punctuation spacing.text-processing12 paramsXSLT TransformBuilt-inApply regex-based tag and text transformation rules to block text.text-processing4 params
translation9
AI TranslateBuilt-inTranslate block content using a configured LLM provider.translation6 paramsDeepL TranslateBuilt-inTranslate block content using DeepL.translation5 paramsDiff LeverageBuilt-inReuse translations from a previous document version for unchanged source blocks.translation2 paramsGoogle TranslateBuilt-inTranslate block content using Google Translate.translation5 paramsMicrosoft Translator TranslateBuilt-inTranslate block content using Microsoft Translator.translation5 paramsModernMT TranslateBuilt-inTranslate block content using ModernMT.translation5 paramsMyMemory TranslateBuilt-inTranslate block content using MyMemory.translation5 paramsPseudo TranslateBuilt-inGenerate deterministic pseudo-translations for localization-readiness testing.translation3 paramsTM LeverageBuilt-inPre-fill translations from a translation memory using exact and fuzzy matching.translation8 params