POST
/
v1
/
transcribe
curl --request POST \
  --url https://api.spectropic.ai/v1/transcribe \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "url": "https://example.com/file.mp3",
  "numSpeakers": 2,
  "language": "en",
  "vocabulary": "Spectropic, AI, LLama, Mistral, Whisper.",
  "webhook": "https://example.com/webhook"
}'
{
  "status": "<string>"
}

Creates a new transcribe job. Sends the finished transcript to the provided webhook URL.

This endpoint supports both a remote file from URL and a local file from multipart form data.

Transcribe a Remote File (from url)

If you have a media file accessible via a URL, you can provide the URL to the file in the request body with the header Content-Type set to application/json.

Typically you would use this method if you have a file stored in a cloud storage service such as Amazon S3.

Make sure the URL to the file is publicly accessible, otherwise our endpoint cannot read the file

Transcribe a Local File

If you have a media file stored locally, you can provide the file in the request body with the header Content-Type set to multipart/form-data.

Typically you would use this method if you have a file stored on a local machine or device.

Receiving the transcript (Webhook)

The webhook URL is where the finished transcript will be sent. The transcript will be sent as a JSON object in the request body.

Make sure the webhook URL is publicly accessible, otherwise our endpoint cannot send the transcript

The request body of the webhook will be equal to the Transcript schema as specified on page Transcript Schema.

Authorizations

Authorization
string
headerrequired

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

url
string
required

Direct URL of an audio or video file. Max 2GB. Make sure to use a publicly accessible URL

numSpeakers
integer

Number of speakers in the transcript. If null, the number of speakers will be detected automatically

language
string

Language of the transcript. If null, the language will be detected automatically. In form of ISO 639-1 code, like "en" or "de"

vocabulary
string

Vocabulary used for the transcript. Similar to the "initial_prompt" parameter of Whisper. Provide acronyms, names and foreign words

webhook
string
required

URL to which the transcript will be sent once it is ready. Make sure to use a publicly accessible URL

Response

200 - application/json
status
string