Batch Syntheses - List
Gets the list of batch syntheses.
GET {endpoint}/avatar/batchsyntheses?api-version=2024-08-01
GET {endpoint}/avatar/batchsyntheses?skip={skip}&maxpagesize={maxpagesize}&api-version=2024-08-01
URI Parameters
| Name | In | Required | Type | Description |
|---|---|---|---|---|
|
endpoint
|
path | True |
string |
Supported Cognitive Services endpoints (protocol and hostname, for example: https://eastus.api.cognitive.microsoft.com). |
|
api-version
|
query | True |
string minLength: 1 |
The API version to use for this operation. |
|
maxpagesize
|
query |
integer (int32) |
The maximum number of result items to include in a single response. |
|
|
skip
|
query |
integer (int32) |
The number of result items to skip. |
Request Header
| Name | Required | Type | Description |
|---|---|---|---|
| Ocp-Apim-Subscription-Key | True |
string |
Provide your cognitive services subscription key here. |
Responses
| Name | Type | Description |
|---|---|---|
| 200 OK |
Success |
|
| Other Status Codes |
An error occurred. Headers x-ms-error-code: string |
Security
Ocp-Apim-Subscription-Key
Provide your cognitive services subscription key here.
Type:
apiKey
In:
header
Examples
Create a project
Sample request
GET {endpoint}/avatar/batchsyntheses?api-version=2024-08-01
Sample response
{
"value": [
{
"id": "talking-avatar-batch-synthesis-job-2",
"status": "Succeeded",
"createdDateTime": "2024-03-15T10:31:43.7134142Z",
"lastActionDateTime": "2024-03-15T10:31:48.1575987",
"inputKind": "PlainText",
"customVoices": {},
"properties": {
"timeToLiveInHours": 744,
"sizeInBytes": 493133,
"durationInMilliseconds": 3530,
"succeededCount": 1,
"failedCount": 0,
"billingDetails": {
"neuralCharacters": 42,
"talkingAvatarDurationSeconds": 3
}
},
"synthesisConfig": {
"voice": "en-us-JennyNeural"
},
"avatarConfig": {
"talkingAvatarCharacter": "lisa",
"talkingAvatarStyle": "casual-sitting",
"videoFormat": "Mp4",
"videoCodec": "hevc",
"subtitleType": "soft_embedded",
"bitrateKbps": 2000,
"customized": false
},
"outputs": {
"result": "https://stttssvcuse.blob.core.windows.net/batchsynthesis-output/3e8138d91be947bd9e076a4cf58a20f8/talking-avatar-batch-synthesis-job-2/0001.mp4?[token redacted]",
"summary": "https://stttssvcuse.blob.core.windows.net/batchsynthesis-output/3e8138d91be947bd9e076a4cf58a20f8/talking-avatar-batch-synthesis-job-2/summary.json?[token redacted]"
}
},
{
"id": "talking-avatar-batch-synthesis-job-1",
"status": "Succeeded",
"createdDateTime": "2024-03-15T10:28:23.7702311Z",
"lastActionDateTime": "2024-03-15T10:28:46.363789",
"inputKind": "PlainText",
"customVoices": {},
"properties": {
"timeToLiveInHours": 744,
"sizeInBytes": 1060360,
"durationInMilliseconds": 3530,
"succeededCount": 1,
"failedCount": 0,
"billingDetails": {
"neuralCharacters": 34,
"talkingAvatarDurationSeconds": 3
}
},
"synthesisConfig": {
"voice": "zh-CN-XiaoxiaoNeural"
},
"avatarConfig": {
"talkingAvatarCharacter": "lisa",
"talkingAvatarStyle": "casual-sitting",
"videoFormat": "Webm",
"videoCodec": "vp9",
"subtitleType": "soft_embedded",
"backgroundColor": "transparent",
"videoCrop": {
"topLeft": {
"x": 100,
"y": 100
},
"bottomRight": {
"x": 1820,
"y": 1080
}
},
"bitrateKbps": 2000,
"customized": false
},
"outputs": {
"result": "https://stttssvcuse.blob.core.windows.net/batchsynthesis-output/3e8138d91be947bd9e076a4cf58a20f8/talking-avatar-batch-synthesis-job-1/0001.webm?[token redacted]",
"summary": "https://stttssvcuse.blob.core.windows.net/batchsynthesis-output/3e8138d91be947bd9e076a4cf58a20f8/talking-avatar-batch-synthesis-job-1/summary.json?[token redacted]"
}
}
],
"nextLink": "https://eastus.api.cognitive.microsoft.com/avatar/batchsyntheses/?maxpagesize=2&skip=2&api-version=2024-08-01"
}
Definitions
| Name | Description |
|---|---|
|
Batch |
Avatar configuration. |
|
Batch |
Coordinate definition |
|
Batch |
Only keep the region defined by this object in original Avatar output video. |
|
Batch |
Batch synthesis task. |
|
Batch |
Batch synthesis billing details. |
|
Batch |
Text-to-speech configuration for plain text input. |
|
Batch |
Batch Synthesis Error. |
|
Batch |
Batch Synthesis input. |
|
Batch |
Batch Synthesis output. |
|
Batch |
Detailed properties of batch synthesis task. |
| Error |
The error object. |
|
Error |
Top-level error code |
|
Error |
Error response. |
|
Inner |
An object containing more specific information about the error. |
|
Input |
Input kind. |
|
Paginated |
Paginated batch synthesis task list |
| Status |
Status of a resource. |
BatchAvatarConfig
Avatar configuration.
| Name | Type | Description |
|---|---|---|
| avatarPosition |
Coordinate definition |
|
| avatarSize |
Coordinate definition |
|
| backgroundColor |
string |
The talking avatar background color. |
| backgroundImage |
string |
The talking avatar background image. |
| bitrateKbps |
integer (int64) |
The bitrate of video output settings for talking avatar. |
| customized |
boolean |
Whether the avatar character is a custom avatar or not. |
| resolution |
Coordinate definition |
|
| subtitleType |
string |
The talking avatar subtitle type. |
| talkingAvatarCharacter |
string minLength: 1 |
The talking avatar character name. e.g., Lisa. |
| talkingAvatarStyle |
string |
The talking avatar character style. e.g., casual-sitting. |
| videoCodec |
string |
The output video codec. |
| videoCrop |
Only keep the region defined by this object in original Avatar output video. |
|
| videoFormat |
string |
The output video format. |
BatchAvatarCoordinateDefinition
Coordinate definition
| Name | Type | Description |
|---|---|---|
| x |
number (double) |
Gets or sets the x-axis. |
| y |
number (double) |
Gets or sets the y-axis. |
BatchAvatarVideoCropDefinition
Only keep the region defined by this object in original Avatar output video.
| Name | Type | Description |
|---|---|---|
| bottomRight |
Coordinate definition |
|
| topLeft |
Coordinate definition |
BatchSynthesis
Batch synthesis task.
| Name | Type | Description |
|---|---|---|
| avatarConfig |
Avatar configuration. |
|
| createdDateTime |
string (date-time) |
The time-stamp when the object was created. |
| customVoices |
object |
The custom voice map of voice name and deployment ID . |
| description |
string |
The description of the batch synthesis task. |
| id |
string minLength: 3maxLength: 64 pattern: ^[a-zA-Z0-9][a-zA-Z0-9._-]{1,62}[a-zA-Z0-9]$ |
Resource id |
| inputKind |
Input kind. |
|
| inputs |
Text or SSML inputs, it's allowed up to 1000 items. |
|
| lastActionDateTime |
string (date-time) |
The time-stamp when the current status was entered. |
| outputs |
Batch Synthesis output. |
|
| properties |
Detailed properties of batch synthesis task. |
|
| status |
Status of a resource. |
|
| synthesisConfig |
Text-to-speech configuration for plain text input. |
BatchSynthesisBillingDetails
Batch synthesis billing details.
| Name | Type | Description |
|---|---|---|
| aoaiCharacters |
integer (int64) |
The number of billable characters for AOAI voices in the input file. |
| aoaiHDCharacters |
integer (int64) |
The number of billable characters for AOAI HD voices in the input file. |
| customNeuralCharacters |
integer (int64) |
The number of billable characters for custom neural voices in the input file. |
| neuralCharacters |
integer (int64) |
The number of billable characters for prebuilt neural voices in the input file. |
| talkingAvatarDurationSeconds |
integer (int64) |
The duration of the talking avatar generated video. |
BatchSynthesisConfig
Text-to-speech configuration for plain text input.
| Name | Type | Description |
|---|---|---|
| pitch |
string |
Pitch for PlainText input. For supported values, see https://learn.microsoft.com/azure/ai-services/speech-service/speech-synthesis-markup-voice#adjust-prosody. |
| rate |
string |
Speaking rate for PlainText input. For supported values, see https://learn.microsoft.com/azure/ai-services/speech-service/speech-synthesis-markup-voice#adjust-prosody. |
| style |
string |
Style name for PlainText input. |
| voice |
string minLength: 1 |
Voice name for PlainText input. You can find platform voice list here https://learn.microsoft.com/azure/ai-services/speech-service/language-support?tabs=tts. |
| volume |
string |
Volume for PlainText input. For supported values, see https://learn.microsoft.com/azure/ai-services/speech-service/speech-synthesis-markup-voice#adjust-prosody. |
BatchSynthesisError
Batch Synthesis Error.
| Name | Type | Description |
|---|---|---|
| code |
string |
Top level error code |
| message |
string |
Error message |
BatchSynthesisInput
Batch Synthesis input.
| Name | Type | Description |
|---|---|---|
| content |
string minLength: 1 |
Input content. It could be SSML or plain text. |
BatchSynthesisOutputs
Batch Synthesis output.
| Name | Type | Description |
|---|---|---|
| result |
string |
sas url of the result file. It would be relative path when "destinationContainerUrl" is specified. |
| summary |
string |
Job summary file path will be provided here. |
BatchSynthesisProperties
Detailed properties of batch synthesis task.
| Name | Type | Default value | Description |
|---|---|---|---|
| billingDetails |
Batch synthesis billing details. |
||
| destinationContainerUrl |
string (uri) |
The requested destination container. It is an URL with Service adhoc SAS to a writeable container in Azure Blob storage. |
|
| destinationPath |
string |
The destination path in destination container, default to null, it only works when "DestinationContainerUrl" is specified, the final path prefix of outputs should be {DestinationPath}/{TaskId}/, or {TaskId} if not specified. |
|
| durationInMilliseconds |
integer (int32) |
The generated video duration in milliseconds. |
|
| error |
Batch Synthesis Error. |
||
| failedCount |
integer (int32) |
The number of failed videos. |
|
| sizeInBytes |
integer (int64) |
The generated video size in bytes. |
|
| succeededCount |
integer (int32) |
The number of succeeded videos. |
|
| timeToLiveInHours |
integer (int32) |
774 |
How long the batch task will be kept in the system after it has completed. Once the batch task reaches the time to live after completion (succeeded or failed) it will be automatically deleted. The longest supported duration is 744 hours (31 days). The default value is 744 hours. |
Error
The error object.
| Name | Type | Description |
|---|---|---|
| code |
Top-level error code |
|
| details |
Error[] |
Additional supportive details regarding the error and/or expected policies. |
| innererror |
An object containing more specific information about the error. |
|
| message |
string |
High level error message. |
| target |
string |
The source of the error. |
ErrorCode
Top-level error code
| Value | Description |
|---|---|
| BadRequest | |
| BadArgument | |
| Unauthorized | |
| Forbidden | |
| NotFound | |
| UnsupportedMediaType | |
| TooManyRequests | |
| InternalServerError | |
| ServiceUnavailable |
ErrorResponse
Error response.
| Name | Type | Description |
|---|---|---|
| error |
The error object. |
InnerError
An object containing more specific information about the error.
| Name | Type | Description |
|---|---|---|
| code |
string |
Detailed error code to help diagnostic. |
| innererror |
An object containing more specific information about the error. |
|
| message |
string |
Detailed error message. |
| target |
string |
The source of the error. |
InputKind
Input kind.
| Value | Description |
|---|---|
| PlainText | |
| SSML |
PaginatedBatchSynthesis
Paginated batch synthesis task list
| Name | Type | Description |
|---|---|---|
| nextLink |
string (uri) |
Link to next page. |
| value |
Batch synthesis tasks. |
Status
Status of a resource.
| Value | Description |
|---|---|
| NotStarted | |
| Running | |
| Succeeded | |
| Failed |