$ man unicode-normalize
/unicode-normalize
PRICE / CALL
$0.005
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
wordmintCATEGORY
uncategorized
STATUS
● live
NAME
unicode-normalize — normalizes unicode text and flags lookalike or hidden characters used in spoofing and phishing
SYNOPSIS
POST https://x402.agentutility.ai/unicode-normalize
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Normalizes Unicode text and flags lookalike or hidden characters used in spoofing and phishing. Normalizes to NFC (default), NFD, NFKC, or NFKD, classifies every codepoint by script (Latin / Cyrillic / Greek / Hebrew / Arabic / CJK / Hangul / etc.), flags Cyrillic / Greek / Latin Extended homoglyphs (the Cyrillic 'а' that looks like Latin 'a', etc.) with their position, codepoint, and the ASCII char they impersonate, and surfaces hidden / formatting characters like zero-width spaces, RTL overrides, and BOMs. Use it for homoglyph detection, IDN spoof checks, invisible-character and zero-width scans, and phishing detection.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| text | string | Input text to normalize and analyze. Up to 100000 chars. | required |
| form | string | Unicode normalization form. Default 'NFC'. enum: NFC · NFD · NFKC · NFKD | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| normalized | string | Input text after Unicode normalization to the requested form (NFC by default). |
| form | string | Normalization form applied: NFC, NFD, NFKC, or NFKD. |
| scripts_detected | string | List of Unicode scripts found in the input (Latin, Cyrillic, Greek, CJK, etc.). |
| homoglyph_warnings | string | Array of suspicious lookalike codepoints with position, codepoint, and the ASCII char they impersonate. |
| hidden_chars | string | Array of invisible or formatting chars found (zero-width spaces, RTL overrides, BOMs) with positions. |
| is_mixed_script | string | True when input mixes scripts in a way that suggests IDN spoofing or phishing. |
| source | string | Origin tag for the result, e.g. local JS normalization. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/unicode-normalize \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster npx -y @agentutility/mcp-<cluster> # Required: EVM private key with USDC on Base export X402_PRIVATE_KEY=0x... # Then call the unicode-normalize tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- wordmintunicode-normalizehomoglyph-detectionidn-spoofzero-width-charsphishing-detectionnfc-nfkclookalike-chars
- methods
- POST
- cluster
- wordmint
- price
- $0.005 USDC per call
ADJACENT — other endpoints in wordmint
| endpoint | description | price |
|---|---|---|
| brand-tagline | Generates brand taglines and slogans for launch pages, X bios, email copy, and product cards. | $0.005 |
| brand-tagline-generate | Generates tagline options for a brand or startup from its name, concept, audience, and tone. | $0.005 |
| card-resolve | Normalizes free-form graded card text into a canonical card object. | $0.005 |
| content-simhash | Fingerprints text with a 64-bit SimHash for near-duplicate detection, computed entirely locally. | $0.005 |
| cron-parse | Cron parser. | $0.005 |
| detect-language | Language detector / language identification. | $0.005 |
| dictionary-define | Looks up English word definitions with pronunciation, part of speech, and synonyms. | $0.005 |
| embedding-similarity | Measures how semantically similar two strings are: embeds both via Venice (default model: text-embedding-bge-m3) and returns the cosine s… | $0.005 |
SEE ALSO