Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
The Custom Web API skill allows you to extend AI enrichment by calling out to a Web API endpoint providing custom operations. Similar to built-in skills, a Custom Web API skill has inputs and outputs. Depending on the inputs, your Web API receives a JSON payload when the indexer runs, and outputs a JSON payload as a response, along with a success status code. The response is expected to have the outputs specified by your custom skill. Any other response is considered an error and no enrichments are performed. The structure of the JSON payload is described further down in this document.
The Custom Web API skill is also used in the implementation of Azure OpenAI On Your Data feature. If Azure OpenAI is configured for role-based access and you get 403 Forbidden calls when creating the vector index, verify that Azure AI Search has a system assigned identity and runs as a trusted service on Azure OpenAI.
Note
The indexer retries twice for certain standard HTTP status codes returned from the Web API. These HTTP status codes are:
- 502 Bad Gateway
- 503 Service Unavailable
- 429 Too Many Requests
@odata.type
Microsoft.Skills.Custom.WebApiSkill
Skill parameters
Parameters are case-sensitive.
| Parameter name | Description | 
|---|---|
| uri | The URI of the Web API to which the JSON payload is sent. Only the https URI scheme is allowed. | 
| authResourceId | (Optional) A string that if set, indicates that this skill should use a system managed identity on the connection to the function or app hosting the code. This property takes an application (client) ID or app's registration in Microsoft Entra ID, in any of these formats: api://<appId>,<appId>/.default,api://<appId>/.default. This value is used to scope the authentication token retrieved by the indexer, and is sent along with the custom Web skill API request to the function or app. Setting this property requires that your search service is configured for managed identity and your Azure function app is configured for a Microsoft Entra sign in. To use this parameter, call the API withapi-version=2023-10-01-Preview. | 
| authIdentity | (Optional) A user-managed identity used by the search service for connecting to the function or app hosting the code. You can use either a system or user managed identity. To use a system managed identity, leave authIdentityblank. | 
| httpMethod | The method to use while sending the payload. Allowed methods are PUTorPOST | 
| httpHeaders | A collection of key-value pairs where the keys represent header names and values represent header values that are sent to your Web API along with the payload. The following headers are prohibited from being in this collection: Accept,Accept-Charset,Accept-Encoding,Content-Length,Content-Type,Cookie,Host,TE,Upgrade,Via. | 
| timeout | (Optional) When specified, indicates the timeout for the http client making the API call. It must be formatted as an XSD "dayTimeDuration" value (a restricted subset of an ISO 8601 duration value). For example, PT60Sfor 60 seconds. If not set, a default value of 30 seconds is chosen. The timeout can be set to a maximum of 230 seconds and a minimum of 1 second. | 
| batchSize | (Optional) Indicates how many "data records" (see JSON payload structure below) is sent per API call. If not set, a default of 1000 is chosen. We recommend that you make use of this parameter to achieve a suitable tradeoff between indexing throughput and load on your API. | 
| degreeOfParallelism | (Optional) When specified, indicates the number of calls the indexer makes in parallel to the endpoint you provide. You can decrease this value if your endpoint is failing under pressure, or raise it if your endpoint can handle the load. If not set, a default value of 5 is used. The degreeOfParallelismcan be set to a maximum of 10 and a minimum of 1. | 
Skill inputs
There are no predefined inputs for this skill. The inputs are any existing field, or any node in the enrichment tree that you want to pass to your custom skill.
Skill outputs
There are no predefined outputs for this skill. Be sure to define an output field mapping in the indexer if the skill's output should be sent to a field in the search index.
Sample definition
{
  "@odata.type": "#Microsoft.Skills.Custom.WebApiSkill",
  "description": "A custom skill that can identify positions of different phrases in the source text",
  "uri": "https://contoso.count-things.com",
  "batchSize": 4,
  "context": "/document",
  "inputs": [
    {
      "name": "text",
      "source": "/document/content"
    },
    {
      "name": "language",
      "source": "/document/languageCode"
    },
    {
      "name": "phraseList",
      "source": "/document/keyphrases"
    }
  ],
  "outputs": [
    {
      "name": "hitPositions"
    }
  ]
}
Sample input JSON structure
This JSON structure represents the payload that is sent to your Web API. It always follows these constraints:
- The top-level entity is called - valuesand is an array of objects. The number of such objects are at most the- batchSize.
- Each object in the - valuesarray has:- A - recordIdproperty that is a unique string, used to identify that record.
- A - dataproperty that is a JSON object. The fields of the- dataproperty correspond to the "names" specified in the- inputssection of the skill definition. The values of those fields are from the- sourceof those fields (which could be from a field in the document, or potentially from another skill).
 
{
    "values": [
      {
        "recordId": "0",
        "data":
           {
             "text": "Este es un contrato en Inglés",
             "language": "es",
             "phraseList": ["Este", "Inglés"]
           }
      },
      {
        "recordId": "1",
        "data":
           {
             "text": "Hello world",
             "language": "en",
             "phraseList": ["Hi"]
           }
      },
      {
        "recordId": "2",
        "data":
           {
             "text": "Hello world, Hi world",
             "language": "en",
             "phraseList": ["world"]
           }
      },
      {
        "recordId": "3",
        "data":
           {
             "text": "Test",
             "language": "es",
             "phraseList": []
           }
      }
    ]
}
Sample output JSON structure
The "output" corresponds to the response returned from your Web API. The Web API should only return a JSON payload (verified by looking at the Content-Type response header) and should satisfy the following constraints:
- There should be a top-level entity called - values, which should be an array of objects.
- The number of objects in the array should be the same as the number of objects sent to the Web API. 
- Each object should have: - A - recordIdproperty.
- A - dataproperty, which is an object where the fields are enrichments matching the "names" in the- outputand whose value is considered the enrichment.
- An - errorsproperty, an array listing any errors encountered that is added to the indexer execution history. This property is required, but can have a- nullvalue.
- A - warningsproperty, an array listing any warnings encountered that is added to the indexer execution history. This property is required, but can have a- nullvalue.
 
- The ordering of objects in the - valuesin either the request or response isn't important. However, the- recordIdis used for correlation so any record in the response containing a- recordId, which wasn't part of the original request to the Web API is discarded.
{
    "values": [
        {
            "recordId": "3",
            "data": {
            },
            "errors": [
              {
                "message" : "'phraseList' should not be null or empty"
              }
            ],
            "warnings": null
        },
        {
            "recordId": "2",
            "data": {
                "hitPositions": [6, 16]
            },
            "errors": null,
            "warnings": null
        },
        {
            "recordId": "0",
            "data": {
                "hitPositions": [0, 23]
            },
            "errors": null,
            "warnings": null
        },
        {
            "recordId": "1",
            "data": {
                "hitPositions": []
            },
            "errors": null,
            "warnings": [
              {
                "message": "No occurrences of 'Hi' were found in the input text"
              }
            ]
        },
    ]
}
Error cases
In addition to your Web API being unavailable, or sending out non-successful status codes the following are considered erroneous cases:
- If the Web API returns a success status code but the response indicates that it isn't - application/jsonthen the response is considered invalid and no enrichments are performed.
- If there are invalid records (for example, - recordIdis missing or duplicated) in the response- valuesarray, no enrichment is performed for the invalid records. It's important to adhere to the Web API skill contract when developing custom skills. You can refer to this example provided in the Power Skill repository that follows the expected contract.
For cases when the Web API is unavailable or returns an HTTP error, a friendly error with any available details about the HTTP error is added to the indexer execution history.