$ man video-to-subtitles
/video-to-subtitles
PRICE / CALL
$0.02
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakitCATEGORY
media
STATUS
● live
NAME
video-to-subtitles — generates subtitles from video with whisper v3, word-wrapped and ready for vlc / premiere / ffmpeg
SYNOPSIS
POST https://x402.agentutility.ai/video-to-subtitles
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Generates subtitles from video with Whisper v3, word-wrapped and ready for VLC / Premiere / FFmpeg. Auto-detects language and can translate to English. Use it as a video subtitle generator, auto-subtitle and closed captions tool, SRT generator, VTT generator, video CC endpoint, or accessibility captions source.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| media_url | string | — | required |
| format | string | — enum: srt · vtt | optional |
| language | string | — | optional |
| task | string | — enum: transcribe · translate | optional |
| max_chars_per_line | number | — | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| subtitles | string | Full subtitle file content as a string. |
| format | string | Echo of the format used. |
| mime_type | string | MIME type for the subtitle format ('application/x-subrip' or 'text/vtt'). |
| cue_count | number | Number of subtitle cues generated. |
| duration_seconds | number | Source media duration. |
| detected_languages | array | Languages auto-detected in the audio. |
| task | string | Echo of the task performed. |
| source_url | string | Echo of the input URL. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.agentutility.ai/video-to-subtitles \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# Install the MCP package for this endpoint's cluster npx -y @agentutility/mcp-<cluster> # Required: EVM private key with USDC on Base export X402_PRIVATE_KEY=0x... # Then call the video-to-subtitles tool from your MCP-aware agent.
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- subtitlessrtvttcaptionswhispertranscription
- env
- FAL_KEY_TRANSCRIBE
- methods
- POST
- cluster
- mediakit
- price
- $0.02 USDC per call
ADJACENT — other endpoints in mediakit
| endpoint | description | price |
|---|---|---|
| add-watermark | Add watermark to PDF, image, or video. | $0.02 |
| audio-loudnorm | Audio loudness normalizer (EBU R128 LUFS). | $0.02 |
| csv-to-jsonl | Converts CSV or TSV data into JSON, JSONL/NDJSON, or column-oriented arrays. | $0.02 |
| image-translate | Image translator. | $0.02 |
| image-upscale | Upscales an image 2x or 4x via Venice's image/upscale endpoint (default model: venice-sd35). | $0.02 |
| image-watermark | Image watermark / add text or logo watermark to image files. | $0.02 |
| mp4-to-mp3 | Converts MP4, MOV, WebM, MKV, AVI, M4V, and FLV video files to MP3 via CloudConvert, with selectable bitrate (96/128/192 kbps). | $0.02 |
| mp4-to-mp3-api | Converts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV URLs into hosted MP3 output with selectable 96, 128, or 192 kbps bitrate. | $0.02 |
SEE ALSO