Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint · rollforge · bestiary · statline · matchpoint · retail · agentops · browserworkflow · modelrouter · compose
$ man semantic-chunk

/semantic-chunk

agentutility / wordmint / semantic-chunk
PRICE / CALL
$0.005
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
wordmint
CATEGORY
uncategorized
STATUS
live
NAME
semantic-chunk splits long text into chunks for rag pipelines, with three modes: 'fixed' (hard char-count windows with overlap), 'sentence' (greedy pack…
SYNOPSIS
POST https://x402.agentutility.ai/semantic-chunk
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Splits long text into chunks for RAG pipelines, with three modes: 'fixed' (hard char-count windows with overlap), 'sentence' (greedy pack of sentences up to chunk_size), 'paragraph' (split on blank lines, never pack across paragraphs). Returns each chunk's text, start/end character offsets, and char count. Use it as a semantic chunker, text splitter, RAG chunker, or sentence + paragraph aware chunking-with-overlap tool.

INPUTrequest schema
propertytypedescriptionreq?
textstringText to split. Up to 1,000,000 chars.required
chunk_sizenumberTarget chunk size in characters. Range [50, 20000]. Default 500.optional
overlapnumberOverlap between chunks in characters. Default 50. Capped at chunk_size - 1.optional
modestringSplitting strategy. Default 'fixed'.
enum: fixed · sentence · paragraph
optional
OUTPUTresponse shape
fieldtypedescription
chunksstringArray of chunk objects, each with text, start offset, end offset, and char count.
chunk_countstringNumber of chunks produced from the input text.
modestringChunking mode used: 'fixed', 'sentence', or 'paragraph'.
chunk_sizestringTarget maximum character size per chunk as applied.
overlapstringCharacter overlap between adjacent chunks (fixed mode) or 0 otherwise.
text_charsstringTotal character length of the input text before chunking.
sourcestringIdentifier for the chunker, e.g. 'semantic-chunk' / local splitter tag.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/semantic-chunk \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster
npx -y @agentutility/mcp-<cluster>

# Required: EVM private key with USDC on Base
export X402_PRIVATE_KEY=0x...

# Then call the semantic-chunk tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
wordminttext-splitterrag-chunkingsentence-awareparagraph-awarechunk-overlapsemantic-chunker
methods
POST
cluster
wordmint
price
$0.005 USDC per call
ADJACENTother endpoints in wordmint
endpointdescriptionprice
brand-taglineGenerates brand taglines and slogans for launch pages, X bios, email copy, and product cards.$0.005
brand-tagline-generateGenerates tagline options for a brand or startup from its name, concept, audience, and tone.$0.005
card-resolveNormalizes free-form graded card text into a canonical card object.$0.005
content-simhashFingerprints text with a 64-bit SimHash for near-duplicate detection, computed entirely locally.$0.005
cron-parseCron parser.$0.005
detect-languageLanguage detector / language identification.$0.005
dictionary-defineLooks up English word definitions with pronunciation, part of speech, and synonyms.$0.005
embedding-similarityMeasures how semantically similar two strings are: embeds both via Venice (default model: text-embedding-bge-m3) and returns the cosine s…$0.005
SEE ALSO
agentutility · wordmint · x402 · mcp · llms.txt · registry.json · bazaar.x402.org