Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint · rollforge · bestiary · statline · matchpoint · retail · agentops · browserworkflow · modelrouter · compose
$ man unicode-normalize

/unicode-normalize

agentutility / wordmint / unicode-normalize
PRICE / CALL
$0.005
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
wordmint
CATEGORY
uncategorized
STATUS
live
NAME
unicode-normalize normalizes unicode text and flags lookalike or hidden characters used in spoofing and phishing
SYNOPSIS
POST https://x402.agentutility.ai/unicode-normalize
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Normalizes Unicode text and flags lookalike or hidden characters used in spoofing and phishing. Normalizes to NFC (default), NFD, NFKC, or NFKD, classifies every codepoint by script (Latin / Cyrillic / Greek / Hebrew / Arabic / CJK / Hangul / etc.), flags Cyrillic / Greek / Latin Extended homoglyphs (the Cyrillic 'а' that looks like Latin 'a', etc.) with their position, codepoint, and the ASCII char they impersonate, and surfaces hidden / formatting characters like zero-width spaces, RTL overrides, and BOMs. Use it for homoglyph detection, IDN spoof checks, invisible-character and zero-width scans, and phishing detection.

INPUTrequest schema
propertytypedescriptionreq?
textstringInput text to normalize and analyze. Up to 100000 chars.required
formstringUnicode normalization form. Default 'NFC'.
enum: NFC · NFD · NFKC · NFKD
optional
OUTPUTresponse shape
fieldtypedescription
normalizedstringInput text after Unicode normalization to the requested form (NFC by default).
formstringNormalization form applied: NFC, NFD, NFKC, or NFKD.
scripts_detectedstringList of Unicode scripts found in the input (Latin, Cyrillic, Greek, CJK, etc.).
homoglyph_warningsstringArray of suspicious lookalike codepoints with position, codepoint, and the ASCII char they impersonate.
hidden_charsstringArray of invisible or formatting chars found (zero-width spaces, RTL overrides, BOMs) with positions.
is_mixed_scriptstringTrue when input mixes scripts in a way that suggests IDN spoofing or phishing.
sourcestringOrigin tag for the result, e.g. local JS normalization.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/unicode-normalize \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster
npx -y @agentutility/mcp-<cluster>

# Required: EVM private key with USDC on Base
export X402_PRIVATE_KEY=0x...

# Then call the unicode-normalize tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
wordmintunicode-normalizehomoglyph-detectionidn-spoofzero-width-charsphishing-detectionnfc-nfkclookalike-chars
methods
POST
cluster
wordmint
price
$0.005 USDC per call
ADJACENTother endpoints in wordmint
endpointdescriptionprice
brand-taglineGenerates brand taglines and slogans for launch pages, X bios, email copy, and product cards.$0.005
brand-tagline-generateGenerates tagline options for a brand or startup from its name, concept, audience, and tone.$0.005
card-resolveNormalizes free-form graded card text into a canonical card object.$0.005
content-simhashFingerprints text with a 64-bit SimHash for near-duplicate detection, computed entirely locally.$0.005
cron-parseCron parser.$0.005
detect-languageLanguage detector / language identification.$0.005
dictionary-defineLooks up English word definitions with pronunciation, part of speech, and synonyms.$0.005
embedding-similarityMeasures how semantically similar two strings are: embeds both via Venice (default model: text-embedding-bge-m3) and returns the cosine s…$0.005
SEE ALSO
agentutility · wordmint · x402 · mcp · llms.txt · registry.json · bazaar.x402.org