Public Beta Caveats and Limitations

Transcription

PII redaction is only available in en-US .

The following limitations apply when fetching media stored in a third-party location (specified by a MediaUrl).

Currently, Voice Intelligence does not send an X-Twilio-Signature header for media fetch requests. As a result, media stored in Twilio Assets needs to be made public.
If you use external recordings, Basic authentication on MediaUrl s is not supported. If you store the recordings on S3, use a presigned URL . And when storing them on Azure Blob Storage, use a Shared Access Signature (SAS) .
MediaUrl s that respond with a HTTP status code that is not a 200 will result in a transcription request that does not complete.
Requests to fetch third-party media are performed exactly once. There is currently no retry behavior.

Voice Intelligence does not perform speaker diarization on recordings. The use of mono recordings will result in lower transcription accuracy.

Recordings that are under two seconds in length are not transcribed.

Only up to two Participants can be overridden in the Channel object of the Transcript resource.

Voice Intelligence will only process recordings or 3rd party media that are under 1GB.