{
  "numSpeakers": 2,
  "language": "en",
  "inferenceStart": "2022-01-01T00:00:00Z",
  "inferenceEnd": "2022-01-01T00:05:00Z",
  "segments": [
    {
      "start": 0.1,
      "end": 5,
      "text": "<string>",
      "speaker": "<string>",
      "avg_logprob": -0.5,
      "words": [
        {
          "start": 0.1,
          "end": 0.5,
          "word": "<string>",
          "probability": 0.9
        }
      ]
    }
  ]
}

Details of the transcript object schema. This is what the webhook will send to your endpoint.

numSpeakers
integer

Number of speakers in the transcript. Either detected value if input was null or value is equal to user input

language
string

Language of the transcript. Either detected value if input was null or value is equal to user input. In form of ISO 639-1 code, like "en" or "de"

inferenceStart
string

Timestamp of the start of the inference

inferenceEnd
string

Timestamp of the end of the inference

segments
object[]

Array of segments, each segment contains spoken text, speaker, start and end time in seconds with decimal and a list of word objects (view below)