GPT-5-series models not supporting data_sources (Azure AI Search) in chat/completions API

Question

GPT-5-series models not supporting data_sources (Azure AI Search) in chat/completions API

Gokulamurthy Purushothaman 0

When using the GPT-5 family models (gpt-5, gpt-5-chat) via the Azure OpenAI chat/completions API, the request fails validation whenever the data_sources parameter (used for Azure AI Search / “On Your Data” integration) is included. This same pattern works correctly with GPT-4o-mini and other GPT-4 variants. I'm unable to do the same in "Playgrounds" under the gpt-5 models, "datasource" parameters were missing.

Expected Behavior

The data_sources parameter should be accepted and processed by GPT-5-series deployments, just as it is for GPT-4o-mini, allowing Azure AI Search (Cognitive Search) to be used as an external grounding source for contextual chat responses.

Steps to Reproduce

Create a GPT-5-chat deployment on Azure OpenAI (2025-01-01-preview).
Call the /chat/completions endpoint with a valid message payload.
Add a data_sources array referencing an existing Azure AI Search index.
Observe the 400 validation error or loss of grounding support.

Sample response:

{

"id": "15281be3-9414-4d7b-a96d-39b05cf600b7",

"model": "gpt-5-chat",

"created": 1761091658,

"object": "extensions.chat.completion",

"choices": [

{

  "index": 0,

  "finish_reason": "stop",

  "message": {

    "role": "assistant",

    "content": "The requested information is not available in the retrieved data. Please try another query or topic.",

    "context": { "citations": [] }

  }

}

],

"usage": { "prompt_tokens": 3175, "completion_tokens": 21, "total_tokens": 3196 }

}

SRILAKSHMI C 8,275 Reputation points Microsoft External Staff Moderator

2025-10-23T04:38:53.7233333+00:00

Hi Gokulamurthy Purushothaman,

Did you get any chance to review the below response. Do let me know if you have any further queries.

Thank you!

2 answers

Your answer

SRILAKSHMI C 8,275 Reputation points Microsoft External Staff Moderator

2025-10-23T04:38:53.7233333+00:00

Hi Gokulamurthy Purushothaman,

Did you get any chance to review the below response. Do let me know if you have any further queries.

Thank you!

Answer 1

Hello Gokulamurthy Purushothaman,

You're correct in observing that GPT-5 preview models (gpt-5, gpt-5-chat) currently have limited support for the data_sources parameter that enables Azure AI Search / "On Your Data" integration. This is a known limitation with the preview release.

Current Status:

The GPT-5 models in the 2025-01-01-preview API version are in early preview and do not yet have full feature parity with GPT-4 models. The data_sources parameter and Azure AI Search grounding capabilities are among the features still being rolled out for GPT-5 series.

Why This Happens:

Preview Limitations: The GPT-5 models are in preview status, and Microsoft is gradually enabling features as they validate performance and compatibility
API Version Dependencies: The extensions API (extensions.chat.completion) that handles data_sources integration may not be fully implemented for GPT-5 yet
Feature Rollout Strategy: Microsoft typically releases new models with core capabilities first, then adds advanced features like RAG (Retrieval-Augmented Generation) integration

Current Workarounds:

Use GPT-4o or GPT-4o-mini: For production workloads requiring Azure AI Search integration, continue using GPT-4o-mini or GPT-4 models which fully support data_sources
Implement Custom RAG: You can implement your own retrieval logic by:
- Querying Azure AI Search directly
- Including retrieved context in your system message or user prompt
- Sending the enriched prompt to GPT-5
Monitor Preview Updates: Since this is a preview API, capabilities are being added regularly

Example Custom RAG Pattern:


# Query Azure Search separately

search_results = azure_search_client.search(query)

context = "\n".join([doc['content'] for doc in search_results])

# Include context in prompt

messages = [

    {"role": "system", "content": f"Answer based on this context: {context}"},

    {"role": "user", "content": user_query}

]

# Call GPT-5 without data_sources parameter

response = openai_client.chat.completions.create(

    model="gpt-5-chat",

    messages=messages

)

Next Steps:

Check Azure Updates: Monitor the Azure OpenAI What's New page for announcements about GPT-5 feature availability
Review API Changelog: The 2025-01-01-preview API is still evolving, and data_sources support may be added in upcoming preview versions
Provide Feedback: Report this through Azure Portal feedback to help prioritize the feature

Expected Timeline:

While I don't have specific dates, data_sources support for GPT-5 models will likely be added as the preview progresses toward general availability. For now, GPT-4o-mini offers excellent performance with full Azure AI Search integration if you need that capability immediately.

Best Regards,

Jerald Felix

Gokulamurthy Purushothaman 0 Reputation points

2025-10-24T03:07:49.94+00:00

Hi Jerald Felix,

Thank you for the detailed clarification and confirmation regarding the current GPT-5 preview limitations. It’s helpful to know that the data_sources (Azure AI Search / On-Your-Data) integration is still in rollout for GPT-5-series models under the 2025-01-01-preview API.

I’ve noticed that the GPT-4.1-mini model fully supports the data_sources parameter and, according to the Azure roadmap, is expected to remain available and supported until April 2026. Would it be advisable for us to upgrade our RAG (product chat) workloads to GPT-4.1-mini for now and continue using that until GPT-5 models achieve feature parity with Azure AI Search integration?

Also, if there’s any preview program or tentative timeline for enabling data_sources with GPT-5 / GPT-5-Chat, please let me know — we’d be interested in participating or testing once it becomes available.

Appreciate your guidance and support on this.

Kind regards,
Gokul
SRILAKSHMI C 8,275 Reputation points Microsoft External Staff Moderator

2025-10-24T12:00:55.15+00:00

Hi Gokulamurthy Purushothaman,

Thank you for the follow-up and for confirming the details on your end you’re absolutely correct in your understanding.

At this time, the GPT-5 and GPT-5-chat models (under the 2025-01-01-preview API) are still in the early preview phase and do not yet support the data_sources (Azure AI Search / On-Your-Data) integration. Microsoft is actively working to bring full feature parity with GPT-4-series models as the preview matures, but this capability has not yet been enabled for GPT-5.

Recommended Approach:

Yes, your plan to migrate or continue your RAG workloads using GPT-4.1-mini is both practical and advisable at this stage.

GPT-4.1-mini offers full support for data_sources, Azure AI Search grounding, and “On Your Data” integration.

It will remain supported until at least April 2026, based on the current Azure roadmap, providing stability for ongoing production workloads.

It also maintains a strong balance between performance, cost, and latency for RAG-based applications.

You can continue building and optimizing your product chat or RAG solution on GPT-4.1-mini and plan for an eventual migration to GPT-5 once the following are confirmed:

Feature parity (data_sources and Azure AI Search integration)

API version stabilization (beyond preview)

GA release timeline published by Microsoft

Upcoming GPT-5 Integration & Preview Access

There’s currently no publicly announced timeline or preview enrollment program specifically for enabling data_sources with GPT-5-series models. However:

The Azure OpenAI team is gradually enabling these integrations across preview APIs.

Please keep monitor the Azure OpenAI “What’s New” page and Azure Updates for announcements on GPT-5 feature releases and regional availability.

Thank you!
SRILAKSHMI C 8,275 Reputation points Microsoft External Staff Moderator

2025-10-27T10:56:31.3966667+00:00

Hi Gokulamurthy Purushothaman,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.

Thank you!

Answer 2

Hi! We have an EastUS2 deployment of gpt-5-mini that shows as GenerallyAvailable (see screenshot below) but also unable to use RAG / Azure AI Search with the Chat Completions API via Microsoft.SemanticKernel version 1.66.

Using the "SetNewMaxCompletionTokensEnabled = true" setting still causes "Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead"

Also setting MaxTokens = 2000 and/or ReasoningEffort = ChatReasoningEffortLevel.Low causes "Validation error at #/reasoning_effort: Extra inputs are not permitted\nValidation error at #/max_completion_tokens: Extra inputs are not permitted"

Using apiVersion: "2025-04-01-preview" produces the same results as above.

Note: Using "SetNewMaxCompletionTokensEnabled = true" without specifying an AzureChatDataSource works great!

What are we missing?
User's image

Share via

GPT-5-series models not supporting data_sources (Azure AI Search) in chat/completions API

2 answers

Your answer