Deployments - Create Or Update
Update the state of specified deployments associated with the Cognitive Services account.
PUT https://management.azure.com/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.CognitiveServices/accounts/{accountName}/deployments/{deploymentName}?api-version=2024-10-01
URI Parameters
| Name | In | Required | Type | Description |
|---|---|---|---|---|
|
account
|
path | True |
string minLength: 2maxLength: 64 pattern: ^[a-zA-Z0-9][a-zA-Z0-9_.-]*$ |
The name of Cognitive Services account. |
|
deployment
|
path | True |
string |
The name of the deployment associated with the Cognitive Services Account |
|
resource
|
path | True |
string minLength: 1maxLength: 90 |
The name of the resource group. The name is case insensitive. |
|
subscription
|
path | True |
string minLength: 1 |
The ID of the target subscription. |
|
api-version
|
query | True |
string minLength: 1 |
The API version to use for this operation. |
Request Body
| Name | Type | Description |
|---|---|---|
| properties |
Properties of Cognitive Services account deployment. |
|
| sku |
The resource model definition representing SKU |
|
| tags |
object |
Resource tags. |
Responses
| Name | Type | Description |
|---|---|---|
| 200 OK |
Create/Update the deployment successfully. |
|
| 201 Created |
Create the deployment successfully. |
|
| Other Status Codes |
Error response describing why the operation failed. |
Examples
PutDeployment
Sample request
PUT https://management.azure.com/subscriptions/subscriptionId/resourceGroups/resourceGroupName/providers/Microsoft.CognitiveServices/accounts/accountName/deployments/deploymentName?api-version=2024-10-01
{
"sku": {
"name": "Standard",
"capacity": 1
},
"properties": {
"model": {
"format": "OpenAI",
"name": "ada",
"version": "1"
}
}
}
Sample response
{
"id": "/subscriptions/subscriptionId/resourceGroups/resourceGroupName/providers/Microsoft.CognitiveServices/accounts/accountName/deployments/deploymentName",
"name": "deploymentName",
"type": "Microsoft.CognitiveServices/accounts/deployments",
"sku": {
"name": "Standard",
"capacity": 1
},
"properties": {
"model": {
"format": "OpenAI",
"name": "ada",
"version": "1"
},
"provisioningState": "Succeeded"
}
}
{
"id": "/subscriptions/subscriptionId/resourceGroups/resourceGroupName/providers/Microsoft.CognitiveServices/accounts/accountName/deployments/deploymentName",
"name": "deploymentName",
"type": "Microsoft.CognitiveServices/accounts/deployments",
"sku": {
"name": "Standard",
"capacity": 1
},
"properties": {
"model": {
"format": "OpenAI",
"name": "ada",
"version": "1"
},
"provisioningState": "Accepted"
}
}
Definitions
| Name | Description |
|---|---|
|
Call |
The call rate limit Cognitive Services account. |
|
created |
The type of identity that created the resource. |
| Deployment |
Cognitive Services account deployment. |
|
Deployment |
Internal use only. |
|
Deployment |
Properties of Cognitive Services account deployment model. |
|
Deployment |
Deployment model version upgrade option. |
|
Deployment |
Properties of Cognitive Services account deployment. |
|
Deployment |
Gets the status of the resource at the time the operation was called. |
|
Deployment |
Properties of Cognitive Services account deployment model. (Deprecated, please use Deployment.sku instead.) |
|
Deployment |
Deployment scale type. |
|
Error |
The resource management error additional info. |
|
Error |
The error detail. |
|
Error |
Error response |
|
Request |
|
| Sku |
The resource model definition representing SKU |
|
Sku |
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT. |
|
system |
Metadata pertaining to creation and last modification of the resource. |
|
Throttling |
CallRateLimit
The call rate limit Cognitive Services account.
| Name | Type | Description |
|---|---|---|
| count |
number |
The count value of Call Rate Limit. |
| renewalPeriod |
number |
The renewal period in seconds of Call Rate Limit. |
| rules |
createdByType
The type of identity that created the resource.
| Value | Description |
|---|---|
| User | |
| Application | |
| ManagedIdentity | |
| Key |
Deployment
Cognitive Services account deployment.
| Name | Type | Description |
|---|---|---|
| etag |
string |
Resource Etag. |
| id |
string |
Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName} |
| name |
string |
The name of the resource |
| properties |
Properties of Cognitive Services account deployment. |
|
| sku |
The resource model definition representing SKU |
|
| systemData |
Metadata pertaining to creation and last modification of the resource. |
|
| tags |
object |
Resource tags. |
| type |
string |
The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts" |
DeploymentCapacitySettings
Internal use only.
| Name | Type | Description |
|---|---|---|
| designatedCapacity |
integer (int32) minimum: 0 |
The designated capacity. |
| priority |
integer (int32) minimum: 0 |
The priority of this capacity setting. |
DeploymentModel
Properties of Cognitive Services account deployment model.
| Name | Type | Description |
|---|---|---|
| callRateLimit |
The call rate limit Cognitive Services account. |
|
| format |
string |
Deployment model format. |
| name |
string |
Deployment model name. |
| publisher |
string |
Deployment model publisher. |
| source |
string |
Optional. Deployment model source ARM resource ID. |
| sourceAccount |
string (arm-id) |
Optional. Source of the model, another Microsoft.CognitiveServices accounts ARM resource ID. |
| version |
string |
Optional. Deployment model version. If version is not specified, a default version will be assigned. The default version is different for different models and might change when there is new version available for a model. Default version for a model could be found from list models API. |
DeploymentModelVersionUpgradeOption
Deployment model version upgrade option.
| Value | Description |
|---|---|
| OnceNewDefaultVersionAvailable | |
| OnceCurrentVersionExpired | |
| NoAutoUpgrade |
DeploymentProperties
Properties of Cognitive Services account deployment.
| Name | Type | Description |
|---|---|---|
| callRateLimit |
The call rate limit Cognitive Services account. |
|
| capabilities |
object |
The capabilities. |
| capacitySettings |
Internal use only. |
|
| currentCapacity |
integer (int32) |
The current capacity. |
| dynamicThrottlingEnabled |
boolean |
If the dynamic throttling is enabled. |
| model |
Properties of Cognitive Services account deployment model. |
|
| parentDeploymentName |
string |
The name of parent deployment. |
| provisioningState |
Gets the status of the resource at the time the operation was called. |
|
| raiPolicyName |
string |
The name of RAI policy. |
| rateLimits | ||
| scaleSettings |
Properties of Cognitive Services account deployment model. (Deprecated, please use Deployment.sku instead.) |
|
| versionUpgradeOption |
Deployment model version upgrade option. |
DeploymentProvisioningState
Gets the status of the resource at the time the operation was called.
| Value | Description |
|---|---|
| Accepted | |
| Creating | |
| Deleting | |
| Moving | |
| Failed | |
| Succeeded | |
| Disabled | |
| Canceled |
DeploymentScaleSettings
Properties of Cognitive Services account deployment model. (Deprecated, please use Deployment.sku instead.)
| Name | Type | Description |
|---|---|---|
| activeCapacity |
integer (int32) |
Deployment active capacity. This value might be different from |
| capacity |
integer (int32) |
Deployment capacity. |
| scaleType |
Deployment scale type. |
DeploymentScaleType
Deployment scale type.
| Value | Description |
|---|---|
| Standard | |
| Manual |
ErrorAdditionalInfo
The resource management error additional info.
| Name | Type | Description |
|---|---|---|
| info |
object |
The additional info. |
| type |
string |
The additional info type. |
ErrorDetail
The error detail.
| Name | Type | Description |
|---|---|---|
| additionalInfo |
The error additional info. |
|
| code |
string |
The error code. |
| details |
The error details. |
|
| message |
string |
The error message. |
| target |
string |
The error target. |
ErrorResponse
Error response
| Name | Type | Description |
|---|---|---|
| error |
The error object. |
RequestMatchPattern
| Name | Type | Description |
|---|---|---|
| method |
string |
|
| path |
string |
Sku
The resource model definition representing SKU
| Name | Type | Description |
|---|---|---|
| capacity |
integer (int32) |
If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted. |
| family |
string |
If the service has different generations of hardware, for the same SKU, then that can be captured here. |
| name |
string |
The name of the SKU. Ex - P3. It is typically a letter+number code |
| size |
string |
The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code. |
| tier |
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT. |
SkuTier
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
| Value | Description |
|---|---|
| Free | |
| Basic | |
| Standard | |
| Premium | |
| Enterprise |
systemData
Metadata pertaining to creation and last modification of the resource.
| Name | Type | Description |
|---|---|---|
| createdAt |
string (date-time) |
The timestamp of resource creation (UTC). |
| createdBy |
string |
The identity that created the resource. |
| createdByType |
The type of identity that created the resource. |
|
| lastModifiedAt |
string (date-time) |
The timestamp of resource last modification (UTC) |
| lastModifiedBy |
string |
The identity that last modified the resource. |
| lastModifiedByType |
The type of identity that last modified the resource. |
ThrottlingRule
| Name | Type | Description |
|---|---|---|
| count |
number |
|
| dynamicThrottlingEnabled |
boolean |
|
| key |
string |
|
| matchPatterns | ||
| minCount |
number |
|
| renewalPeriod |
number |