CosmosDB for Mongo upgrade stuck for hours

Question

CosmosDB for Mongo upgrade stuck for hours

William Souza 0

We are trying to scale a Cosmos DB for a Mongo instance, but it is getting stuck for hours. As an alternative, we tried to create another resource, but it also got stuck in the upgrade process.

This resource is showing a weird behavior, with high CPU usage even when our API has lower traffic, and we are not able to fix it, as the upgrade process is not working.
User's image

User's image

2 answers

Your answer

Answer 1

Hello William Souza,

Sorry for the disruption with your Cosmos DB for MongoDB—stuck upgrades (e.g., scaling RU/s) for hours, plus new resource hangs and unexplained high CPU during low traffic, point to backend throttling or regional capacity issues, common after the 2025 Mongo API updates. As an Azure specialist, I've seen this in production—here's a concise fix path.

Quick Diagnostics

Check Status: In portal, Cosmos DB > Your account > Metrics—look for "Provisioned Throughput" stuck in "Updating." Resource health may show "Degraded" (platform-initiated).
Logs: Enable Diagnostic settings > Log Analytics; query for "DataPlaneRequest" errors or "UpgradeFailed." CLI: az cosmosdb show --resource-group <rg> --name <account> --query "properties.provisioningState".
CPU Spike: High CPU with low traffic? Check for hot partitions (indexing loops)—use Query Explorer for slow Mongo queries.

Resolution Steps

Retry with Limits: Cancel via CLI: az cosmosdb database update --resource-group <rg> --name <account> --database <db> --throughput 400 (start low). Wait 30 mins; if stuck, scale to a single-node setup temporarily.
Failover or Region Switch: If multi-region, Global distribution > Failover priority to another region (e.g., East US 2). For new resources, deploy in a different region like West Europe.
Optimize for CPU: Add indexes on frequent queries; use autoscale (400-1000 RU/s) instead of fixed. Monitor Request units—throttling causes CPU spikes.
Escalate: Open support ticket: Help + support > New request > Technical > Cosmos DB > Scaling. Set Severity C (hours impact); include account ID, upgrade timestamp. Resolutions often in 1-2 hours via backend force-complete.

Workaround: Export data via mongodump to a new account if urgent. Track at status.azure.com for outages.

Best Regards,

Jerald Felix

Answer 2

William Souza 0

Hi @Jerald Felix MCT

Thank you for your reply. In the end, we fixed it by restoring a backup to a new account. We already deleted the old resources, but they were still stuck as 'updating'.

Kalyani Kondavaradala 3,310 Reputation points Microsoft External Staff Moderator

2025-10-28T13:11:25.98+00:00
Hi William Souza,

Good to know that you have tried workaround to get the DB, Can you please confirm, is your old server are in dropping state or Updating state can you check once again and confirm ?

az cosmosdb show --name <account-name> --resource-group <resource-group>

Can you please confirm in which region your server exists?

Thanks!

Kalyani
Kalyani Kondavaradala 3,310 Reputation points Microsoft External Staff Moderator

2025-10-29T09:55:16.2066667+00:00

Hi William Souza,

Just checking in could you please confirm the details requested in the comments section? This will help us better understand your scenario and provide the most appropriate solution for your issue.

Looking forward to your response.

Thanks,

Kalyani

Share via

CosmosDB for Mongo upgrade stuck for hours

2 answers

Quick Diagnostics

Resolution Steps

Your answer