401 Authentication Error When Deploying Managed Compute Endpoint

Freddy Mei 20 Reputation points
2025-10-21T22:40:59.9266667+00:00

I’m deploying an endpoint using Managed Compute mode for the gpt-5-mini model, and the deployment fails with a 401 “Server failed to authenticate the request” error. From the logs, it appears that the container is attempting to download the model from an Azure Blob Storage location, but authentication to that storage account is failing.

The blob storage seems to be an internal Azure-managed resource, so I don’t have direct access or visibility to modify permissions or regenerate credentials.

Has anyone encountered this issue before? Are there any guidelines or configuration steps to ensure the managed endpoint can successfully authenticate when retrieving the model? Any guidance would be greatly appreciated.

Please let me know if additional details about the error message or deployment are needed.

User's image

User's image

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
{count} votes

1 answer

Sort by: Most helpful
  1. Jerald Felix 7,520 Reputation points
    2025-10-22T02:50:25.4266667+00:00

    Hey Freddy,

    That's a frustrating one! I've seen this authentication issue pop up a few times with managed compute deployments, especially with the newer GPT-5 models. The 401 error you're seeing is usually related to the managed identity not having the right permissions to access Azure's internal blob storage.

    Here's what's likely happening: When you deploy a model on managed compute, Azure creates a system-assigned managed identity for your endpoint that needs to download the model files from Microsoft's internal storage. Sometimes this identity doesn't get the right permissions automatically.

    Quick things to try:

    1. Check your workspace permissions - Make sure your user account has "Contributor" or "Owner" role on the AI Foundry workspace (not just the resource group)
    2. Try a different region - I know it's annoying, but sometimes certain regions have temporary issues with the automatic permission assignment. If you're not tied to a specific region, try deploying in East US or West Europe
    3. Wait and retry - Sometimes it's just a temporary glitch. Give it 15-20 minutes and try deploying again

    If those don't work:

    Open a support ticket and mention the specific error from your screenshot. The Azure AI team can manually fix the managed identity permissions on the backend - they've had to do this for several people I know.

    When you open the ticket, include:

    • Your workspace name and region
    • The exact model you're trying to deploy (gpt-5-mini)
    • The timestamp of when you tried the deployment
    • That screenshot you already have

    One more thing to check: Make sure you're not hitting any quota limits for GPT-5 models in your subscription. Sometimes quota issues can show up as weird authentication errors.

    Let me know if any of those work for you! This is definitely a known issue with the service, not something you did wrong.

    Best Regards,

    Jerald Felix


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.