$ man semantic-chunk
/semantic-chunk
PRICE / CALL
$0.005
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
wordmintCATEGORY
uncategorized
STATUS
● live
NAME
semantic-chunk — splits long text into chunks for rag pipelines, with three modes: 'fixed' (hard char-count windows with overlap), 'sentence' (greedy pack…
SYNOPSIS
POST https://x402.agentutility.ai/semantic-chunk
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Splits long text into chunks for RAG pipelines, with three modes: 'fixed' (hard char-count windows with overlap), 'sentence' (greedy pack of sentences up to chunk_size), 'paragraph' (split on blank lines, never pack across paragraphs). Returns each chunk's text, start/end character offsets, and char count. Use it as a semantic chunker, text splitter, RAG chunker, or sentence + paragraph aware chunking-with-overlap tool.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| text | string | Text to split. Up to 1,000,000 chars. | required |
| chunk_size | number | Target chunk size in characters. Range [50, 20000]. Default 500. | optional |
| overlap | number | Overlap between chunks in characters. Default 50. Capped at chunk_size - 1. | optional |
| mode | string | Splitting strategy. Default 'fixed'. enum: fixed · sentence · paragraph | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| chunks | string | Array of chunk objects, each with text, start offset, end offset, and char count. |
| chunk_count | string | Number of chunks produced from the input text. |
| mode | string | Chunking mode used: 'fixed', 'sentence', or 'paragraph'. |
| chunk_size | string | Target maximum character size per chunk as applied. |
| overlap | string | Character overlap between adjacent chunks (fixed mode) or 0 otherwise. |
| text_chars | string | Total character length of the input text before chunking. |
| source | string | Identifier for the chunker, e.g. 'semantic-chunk' / local splitter tag. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/semantic-chunk \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster npx -y @agentutility/mcp-<cluster> # Required: EVM private key with USDC on Base export X402_PRIVATE_KEY=0x... # Then call the semantic-chunk tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- wordminttext-splitterrag-chunkingsentence-awareparagraph-awarechunk-overlapsemantic-chunker
- methods
- POST
- cluster
- wordmint
- price
- $0.005 USDC per call
ADJACENT — other endpoints in wordmint
| endpoint | description | price |
|---|---|---|
| brand-tagline | Generates brand taglines and slogans for launch pages, X bios, email copy, and product cards. | $0.005 |
| brand-tagline-generate | Generates tagline options for a brand or startup from its name, concept, audience, and tone. | $0.005 |
| card-resolve | Normalizes free-form graded card text into a canonical card object. | $0.005 |
| content-simhash | Fingerprints text with a 64-bit SimHash for near-duplicate detection, computed entirely locally. | $0.005 |
| cron-parse | Cron parser. | $0.005 |
| detect-language | Language detector / language identification. | $0.005 |
| dictionary-define | Looks up English word definitions with pronunciation, part of speech, and synonyms. | $0.005 |
| embedding-similarity | Measures how semantically similar two strings are: embeds both via Venice (default model: text-embedding-bge-m3) and returns the cosine s… | $0.005 |
SEE ALSO