Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint · rollforge · bestiary · statline · matchpoint · retail · agentops · browserworkflow · modelrouter · compose
$ man ocr

/ocr

PRICE / CALL
$0.0025
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakit
CATEGORY
uncategorized
STATUS
live
NAME
ocr runs ocr on scanned pdfs and image-based documents, returning clean markdown or plain text
synonym alias of pdf-to-markdown — reuses the canonical handler.
SYNOPSIS
POST https://x402.agentutility.ai/ocr
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Runs OCR on scanned PDFs and image-based documents, returning clean Markdown or plain text. The Datalab Marker engine preserves layout, tables, and math. 30 pages max. Use it as an OCR API, optical character recognition service, scanned document extractor, or image-PDF to text and scanned PDF OCR tool.

INPUTrequest schema
propertytypedescriptionreq?
pdf_urlstringPublic URL of a PDF file (http or https). Must be directly fetchable, not behind auth or a viewer redirect. Max 30 pages.required
output_formatstring'markdown' (default — best for LLM downstream), 'html' (preserves more layout structure), or 'json' (per-page blocks with type + bbox).
enum: markdown · html · json
optional
OUTPUTresponse shape
fieldtypedescription
markdownstringExtracted text from the document in Markdown format, preserving headings, tables, and math layout.
page_countstringNumber of pages processed from the input PDF or image document.
source_urlstringURL of the source PDF or image file that was passed in for OCR processing.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/ocr \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster
npx -y @agentutility/mcp-<cluster>

# Required: EVM private key with USDC on Base
export X402_PRIVATE_KEY=0x...

# Then call the ocr tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
ocrmediakitpdf-extractionscanned-documentsimage-to-textmarkdown-extractiontable-extractiondatalab-marker
methods
POST
cluster
mediakit
price
$0.0025 USDC per call
ADJACENTother endpoints in mediakit
endpointdescriptionprice
convert-pdfConverts PDFs to Markdown, HTML, JSON, or structured text with the Datalab Marker AI pipeline, preserving headings, tables, equations, an…$0.0025
pdf-parser-apiParses a public PDF URL into Markdown, HTML, or JSON blocks with layout-aware text, headings, tables, and equations.$0.0025
pdf-text-extractorExtracts clean Markdown, HTML, or structured JSON from digital or scanned PDFs while preserving reading order, tables, and equations.$0.0025
pdf-to-markdownConverts digital or scanned PDFs to clean Markdown with AI-powered, layout-aware extraction on the Datalab Marker engine.$0.0025
pdf-to-markdown-apiConverts a public PDF URL into clean Markdown, HTML, or structured JSON while preserving headings, tables, equations, and reading order.$0.0025
pdf-to-textExtracts text from digital or scanned PDFs, preserving reading order across multi-column layouts with an AI + OCR pipeline (Datalab Marker).$0.0025
pdf-to-text-apiExtracts text from digital and scanned PDFs as Markdown, plain text, HTML, or JSON with layout-aware reading order.$0.0025
compress-pdfPDF compressor / PDF size reducer.$0.005
SEE ALSO
agentutility · mediakit · x402 · mcp · llms.txt · registry.json · bazaar.x402.org