Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint · rollforge · bestiary · statline · matchpoint · retail · agentops · browserworkflow · modelrouter · compose
$ man video-to-subtitles

/video-to-subtitles

agentutility / mediakit / video-to-subtitles
PRICE / CALL
$0.02
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakit
CATEGORY
media
STATUS
live
NAME
video-to-subtitles generates subtitles from video with whisper v3, word-wrapped and ready for vlc / premiere / ffmpeg
SYNOPSIS
POST https://x402.agentutility.ai/video-to-subtitles
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Generates subtitles from video with Whisper v3, word-wrapped and ready for VLC / Premiere / FFmpeg. Auto-detects language and can translate to English. Use it as a video subtitle generator, auto-subtitle and closed captions tool, SRT generator, VTT generator, video CC endpoint, or accessibility captions source.

INPUTrequest schema
propertytypedescriptionreq?
media_urlstringrequired
formatstring
enum: srt · vtt
optional
languagestringoptional
taskstring
enum: transcribe · translate
optional
max_chars_per_linenumberoptional
OUTPUTresponse shape
fieldtypedescription
subtitlesstringFull subtitle file content as a string.
formatstringEcho of the format used.
mime_typestringMIME type for the subtitle format ('application/x-subrip' or 'text/vtt').
cue_countnumberNumber of subtitle cues generated.
duration_secondsnumberSource media duration.
detected_languagesarrayLanguages auto-detected in the audio.
taskstringEcho of the task performed.
source_urlstringEcho of the input URL.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/video-to-subtitles \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster
npx -y @agentutility/mcp-<cluster>

# Required: EVM private key with USDC on Base
export X402_PRIVATE_KEY=0x...

# Then call the video-to-subtitles tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
subtitlessrtvttcaptionswhispertranscription
env
FAL_KEY_TRANSCRIBE
methods
POST
cluster
mediakit
price
$0.02 USDC per call
ADJACENTother endpoints in mediakit
endpointdescriptionprice
add-watermarkAdd watermark to PDF, image, or video.$0.02
audio-loudnormAudio loudness normalizer (EBU R128 LUFS).$0.02
csv-to-jsonlConverts CSV or TSV data into JSON, JSONL/NDJSON, or column-oriented arrays.$0.02
image-translateImage translator.$0.02
image-upscaleUpscales an image 2x or 4x via Venice's image/upscale endpoint (default model: venice-sd35).$0.02
image-watermarkImage watermark / add text or logo watermark to image files.$0.02
mp4-to-mp3Converts MP4, MOV, WebM, MKV, AVI, M4V, and FLV video files to MP3 via CloudConvert, with selectable bitrate (96/128/192 kbps).$0.02
mp4-to-mp3-apiConverts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV URLs into hosted MP3 output with selectable 96, 128, or 192 kbps bitrate.$0.02
SEE ALSO
agentutility · mediakit · x402 · mcp · llms.txt · registry.json · bazaar.x402.org