Create Transcription
docId plus a pre-signed uploadUrl. Upload your .mp3 to that URL via PUT (the URL is valid for 1 hour) to begin processing.
The type field controls how the resulting transcript is shaped:
single_voice— plain transcript with no speaker labels. Use for monologues, dictations, voicemails, or any single-speaker audio.phone_call— diarized transcript with inlineԽոսնակ 1:/Խոսնակ 2:speaker labels, pinned to exactly two speakers. Use for two-party conversations (support, sales, interview calls).
- Webhook (push) — Xosum POSTs the transcript to a URL you host. See Receive Transcription.
- Polling (pull) — call
GET /api/getTranscriptionwith thedocIduntilstatusistranscribed. Recommended when you can’t expose a publicly reachable webhook receiver.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
The kind of audio being transcribed. This value also controls diarization behavior in the resulting transcript: single_voice returns a plain transcript with no speaker labels — use this for monologues, dictations, voicemails, single-speaker recordings. phone_call returns a diarized transcript with inline speaker labels (Խոսնակ 1: / Խոսնակ 2:) and is pinned to exactly two speakers — use this for two-party conversations such as support or sales calls.
single_voice, phone_call Optional metadata to attach to the transcription (e.g., user IDs, external info).
{
"phonenumber": "+37400000000",
"agent": "King Ruzi"
}Optional. The ID of a QA Checklist to run an automated analysis on the transcript. You can find the Checklist ID in the "QA Ստուգացուցակներ" (QA Checklists) section of the Xosum.am web app. The results of the analysis will be available in the "History" page of the web app after the transcription is complete.
"a1B2c3D4e5F6g7H8i9J0"
