Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Find out about recent changes in the Azure Well-Architected Framework.
September 2025
New articles
- Develop a disaster recovery plan for multi-region deployments: Find recommendations for building a disaster recovery (DR) plan for multi-region Azure deployments. This guide explains key terminology, how to classify workloads by criticality, and how to align recovery strategies with business impact. It covers DR planning essentials like communication plans, runbooks, and escalation paths, and provides practical advice for optimizing recovery costs. Step-by-step actions and validation methods are included for backup and restore, active-passive (cold and warm standby), and active-active deployments. Regular testing and continuous improvement are emphasized to ensure business continuity.
- Architecture best practices for Azure Databricks: Find recommendations for designing and operating Azure Databricks workloads by using Well-Architected Framework principles. Learn how to apply reliability, security, cost optimization, operational excellence, and performance efficiency to Spark, Delta Lake, Unity Catalog, and MLflow solutions.
- Architecture strategies for using availability zones and regions: Learn how to choose between deploying workloads across Azure availability zones or regions to meet reliability, resiliency, cost, and performance goals. This guide explains the differences between locally redundant, zonal (pinned), zone-redundant, and multi-region deployments, and describes the trade-offs for each approach. It provides practical recommendations for aligning deployment strategies with business requirements like risk tolerance, data residency, user location, budget, and complexity. Example scenarios and guidance for combining multi-zone and multi-region architectures are included for mission-critical solutions.
Updated articles
- Architecture pattern for mission-critical workloads on Azure: We simplified the baseline architecture section by removing extra lines, card outlines, and clickable links on diagrams.
- Health modeling for workloads: We added a section about Azure Monitor health models. It describes how they simplify health modeling with built-in alerting, visualizations, and easy integration. We also included a screenshot and updated related links to reference Azure Monitor health models.
- Reliability design principles: We clarified workload scope and team commitments, added clear instructions for requirements and solution boundaries, highlighted reliability for user flows, emphasized time horizons and dependencies, and fixed wording in the resilience section.
- Test and evaluate AI workloads on Azure: We refreshed the guidance for testing and evaluating AI workloads, added new recommendations for validation methods, and clarified best practices for continuous improvement.
- Architecture strategies for designing for redundancy: We updated headings in the redundancy documentation to improve clarity and consistency. We also expanded the redundancy guidance, added new recommendations for zone and region-level redundancy, and clarified trade-offs for different approaches.
- Architecture best practices for Azure Machine Learning: We refreshed the guidance for Azure Machine Learning, added new recommendations for reliability, security, and cost optimization, and improved tips for operational excellence.
- Architecture strategies for designing and creating a monitoring system: We expanded the observability guidance with new recommendations for network monitoring, alerting, and diagnostics, and clarified best practices for operational excellence.
- Architecture strategies for designing a reliable monitoring and alerting strategy: We refreshed the monitoring and alerting strategy guidance, added new recommendations for monitoring network traffic to improve reliability, and expanded best practices for using Azure tools in incident response.
- Architecture best practices for Azure Files: We revised the Azure Files guide to clarify terminology, update redundancy and billing model details, add guidance for SSD file shares and metadata caching, and include total cost of ownership (TCO) resources. We also added guidance about using the Azure File Sync Arc extension for hybrid environments.
Azure feature updates
This month, we incorporated newly released Azure features from the Azure updates feed into our guidance. The most significant examples are highlighted below.
- Architecture best practices for Azure App Service (Web Apps): Added IPv6 support considerations for scaling and clarified related network protocol guidance.
- Architecture best practices for Azure Firewall: Incorporated guidance for Resource Health monitoring, explicit proxy configuration, customer-controlled maintenance, and change tracking capabilities.
- Architecture best practices for Azure Application Gateway v2: Integrated new AI-powered threat analysis and response capabilities.
- Architecture best practices for Azure Virtual Network: Added recommendations for centralized IP address management and integrated diagnostics with Azure Network Watcher VM Network Troubleshooter.
- Architecture best practices for Azure SQL Database: Updated zone redundancy guidance to reflect improved physical separation features for high availability.
- Architecture best practices for Azure Traffic Manager: Enhanced health probing and failover practices based on recent platform improvements.
- Architecture best practices for Azure Event Hubs: Added support for geo-replication and custom dead-letter queues.
Retired articles
We retired the following articles this month. The content was outdated and no longer aligned with the Azure Well-Architected Framework.
- Azure OpenAI Service service guide
- Azure Cache for Redis service guide
August 2025
Updated articles
- Workload team personas for AI workloads: We have updated this article to reflect the latest advances in generative AI, with a focus on agentic solutions. We added a section about agentic personas to introduce the agentic personas and their role in AI workloads. We also expanded the example personas to include both human and automated agent roles in development and operations. The persona templates and examples now reflect dynamic, cross-system, and just-in-time access needs, emphasizing governance, accountability, and the unique requirements of agentic systems.
- Architecture best practices for Log Analytics: We expanded and restructured the guidance for Log Analytics workspaces and introduced detailed recommendations, checklists, and best practices for the Well-Architected Framework pillars.
- Performance Efficiency design principles: We made the guidance clearer and more actionable. We simplified technical language and streamlined recommendations to help teams align performance goals with business needs. These changes make it easier to plan, build, and maintain systems that perform reliably over time.
- Architecture design diagrams: We updated the guidance about diagramming practices and architecture diagram types by adding new recommendations for clarity, accessibility, version control, and layered visuals. We also introduced new diagram categories and provided more detailed descriptions of diagram purposes and best practices.
- Architecture strategies for implementing automation: We added Azure tools that you can use to automate tasks for your workload. Learn about automated management capabilities for networking services, including Azure Firewall customer-controlled maintenance, Azure Firewall fully qualified domain name (FQDN) filtering in destination network address translation (DNAT) rules, and Azure Front Door managed certificates.
- Architecture strategies for networking and connectivity: We added Azure Network Security Perimeter to the list of Azure services that you can use to add defense-in-depth capabilities to your network.
- Architecture strategies for building a segmentation strategy: We added another network segmentation pattern: PaaS isolation. We recommend using Azure Network Security Perimeter with this pattern.
- Architecture best practices for Azure Firewall: We added Microsoft Security Copilot as a tool for threat investigation and analysis and information about ingestion-time transformation in Log Analytics to help you reduce costs. We also added configuration recommendations for the Operational Excellence pillar.
- Architecture best practices for Azure Front Door: We made changes to emphasize the use of AI-powered security capabilities, including Security Copilot integration for web application firewall event analysis. We also added guidance about managed wildcard Transport Layer Security (TLS) certificates.
- Architecture best practices for Azure Kubernetes Service (AKS): We added guidance for AKS clusters, including recommendations for HTTP proxy support, custom certificate authority integration, and Azure CNI static block allocation to improve compliance, security, and network management.
- Architecture best practices for Azure Virtual Network: We added recommendations for using Network Security Perimeter for PaaS service isolation, centralized IP address management with Azure Virtual Network Manager, and integrated diagnostics with Network Watcher VM Network Troubleshooter.
- Architecture best practices for Azure Application Gateway v2: We removed the specific examples and feature comparisons between Azure Front Door and Application Gateway. The guidance now focuses on using WAF policies and locking down Application Gateway to receive traffic only from Azure Front Door. We also clarified language and updated references in the guide. We improved section headings and checklists to make guidance more accurate and easier to follow.
- Architecture best practices for Azure SQL Database: We added sharding guidance, clarified zone redundancy setup, and introduced automated backup recommendations to strengthen reliability and recovery for Azure SQL Database. We also improved cost optimization and performance recommendations, added advice to use native SQL functions, and noted the latest Azure updates.
- Architecture best practices for Azure Disk Storage: We added a section that describes the design trade-offs that you might have to make if you use the approaches in the pillar checklists, including guidance about performance versus cost for Azure Premium SSD, Azure Ultra Disk Storage, Azure Standard SSD, and Azure Standard HDD. We also added recommendations for just-in-time capacity provisioning and dynamic disk expansion without downtime.
- Complete an Azure Well-Architected Review assessment: We added a tip about selecting Core Well-Architected Review in the Azure Well-Architected Review assessment. We also added a section about specialized Well-Architected review assessments for specific technologies and workloads.
- Azure Well-Architected Framework workloads: We expanded the definition of workloads to include custom code and AI models, emphasized architectural practices like decomposing workloads and addressing technical debt, and added sections about the organization of workload teams, dependencies, budgeting, and continuous improvement. We also added guidance about shared responsibilities and governance within cloud environments.
July 2025
Updated articles
- Responsible AI in Azure workloads: We refreshed this article with new guidance for agentic AI systems, including safeguards for retrieval and autonomous agents, auditability, and human oversight. Content safety recommendations now include watermarking, metadata tagging, and clear disclosure of AI-generated media. Ethical updates emphasize the ability to contest AI decisions and ensure transparency in system changes. These additions support more responsible and secure deployment of advanced AI solutions.
- Architecture best practices for Azure Local: We updated this guidance to reflect the latest platform version (2311), refreshed documentation links, clarified pricing details, clarified role definitions, and simplified terminology. We also improved how we describe security, monitoring, and licensing features to make the guidance easier to follow and more relevant to current deployments.
- Application design for AI workloads on Azure We added a link to common AI agent orchestration patterns to help teams explore proven strategies before designing their own.
June 2025
Maturity models
This month, we introduced maturity models for the Azure Well-Architected Framework. Maturity models help you assess your current state and identify areas for improvement across the five pillars of the framework. Each model provides a structured approach to evaluate your workload's architecture and operations, enabling you to prioritize enhancements and track progress over time.
- Reliability maturity model
- Security maturity model
- Cost Optimization maturity model
- Operational Excellence maturity model
- Performance Efficiency maturity model
- Assessment
New article
- Sustainable design for AI workloads on Azure: Find guidance about how to build AI workloads sustainably by incorporating environmental considerations into model design, data design, and operations phases. Learn how to reduce energy consumption and carbon footprint while maintaining performance and business requirements for AI workloads on Azure.
Updated articles
- Design methodology for mission-critical workloads on Azure: We refreshed this article by restructuring the design methodology to focus on core design fundamentals through a principle-based approach. The article now emphasizes designing for reliability objectives, end-to-end automation, zero-downtime deployments, fast failure detection and recovery, and evolving with Azure. We streamlined the content to be more concise and actionable while maintaining focus on essential design principles.
- Mission-critical workloads: We restructured the mission-critical workloads overview article to improve user experience by adding a new How to use this guidance? section that provides step-by-step navigation instructions. We reorganized the content to improve the information hierarchy, moving from a reference-style presentation to a guided learning approach that walks users through the methodology, principles, and design areas systematically. We also renamed the Illustrative examples section to Reference architecture examples and added video content.
- Design principles of a mission-critical workload: We updated the Next step section. Both the link destination and link text changed from "Cross-cutting concerns" to "Architecture pattern" while keeping the same descriptive text about reviewing cross-cutting concerns for mission-critical workloads.
- Grounding data design for AI workloads on Azure: We refreshed our guidance on grounding data by making it clear that data can come from various sources, such as databases with vector indexes and external systems, not only traditional indexes. The updates reflect the benefits of larger context windows in newer models, clarifies previous terminology around fine-tuning data, and highlights the importance of validating grounding data through real-world queries. Other improvements include updated guidance about security trimming, support for multi-media embeddings, and new considerations for agentic solutions.
May 2025
Updated articles
- Recommendations for designing a reliable scaling strategy: Explore updated content including: Choosing the right technology for scaling; Automating scaling operations, including use of Infrastructure-as-Code; Selecting and optimizing "scale units"; Scaling data stores using sharding and partitioning, and optimizing partition strategies; Monitoring scaling operations and log analysis.
- Design a data partitioning strategy: This article can now be found under "Design guides."
- Cost Optimization design principles: Find actionable recommendations, such as treating different environments differently, using dynamic scaling, and collaborating with licensing teams. We added guidance on governance and cost guardrails and expanded examples and practical steps for budgeting, rate optimization, and maximizing resource utilization.
- Operational Excellence cloud design principles: We refreshed this article, consolidating guidance and updating safe deployment practices.
Service guides
This month, we made significant updates to some of our service guides. Here are the highlights:
- Architecture Best Practices for Azure Database for PostgreSQL: Find recommendations for features like high availability, geo-redundant backup, private networking, Microsoft Entra ID integration, cost modeling, automation, monitoring, and intelligent performance tuning.
- Architecture Best Practices for Azure Cosmos DB for NoSQL: Explore guidance updated to reflect current Azure Cosmos DB best practices, including new features, policy links, and recommendations for using Microsoft Entra ID, private endpoints, Azure Policy, and Azure Advisor.
April 2025
New articles
- Architecture Best Practices for Azure Container Apps: Explore key recommendations and design checklists for implementing Azure Container apps effectively and securely. This guide covers design principles, strategies, and recommendations for achieving architectural goals, including security, performance, and cost optimization.
Updated articles
- Application Delivery Considerations for Azure Virtual Desktop Workloads: We made significant updates to refactor and refresh the guidance about Azure Virtual Desktop including updated recommendations and best practices.
- Recommendations for designing a disaster recovery strategy: We added a new section on Azure Backup facilitation and updated the content to include new Azure Backup features.
Service guides
This month, we made significant updates to some of our service guides. Here are the highlights:
- Architecture Best Practices for Azure API Management: We refactored the guidance for Azure API Management to improve clarity and usability. The updated content includes refreshed design principles, strategies, and recommendations for achieving architectural goals.
- Architecture Best Practices for Azure NetApp Files: We added new recommendations for cost optimization and operational excellence. Explore the fully updated content including new recommendations for configuring Azure NetApp Files to protect your workloads.
- Architecture Best Practices for Azure SQL Database: We refreshed and expanded this guide to introduce new design considerations and recommendations, including guidance on using Azure SQL Database for secure data storage, managing database performance, and optimizing costs.
- Architecture Best Practices for Azure Virtual Machines and Scale Sets: We made updates to the guidance about Azure Virtual Machines and Scale Sets. Explore the fully updated content including new recommendations for configuring automatic recovery options to protect your workloads.
March 2025
Updated articles
- Architecture Best Practices for Azure Front Door: We added details on deployment strategies and the importance of caching static content to the design checklist. Recommendations were updated to include new links and advice on managing traffic, health probes, and optimizing caching.
- Design review checklist for Operational Excellence: We simplified the design review checklist for Operational Excellence to make the recommendations more concise, focused, and actionable.
February 2025
Updated articles
- Recommendations for self-healing and self-preservation: We updated this guide for easier readability by streamlining several sections. We also moved the guides for handling transient faults and developing background jobs into the design guide area to make the Reliability pillar easier to use.
- Design review checklist for Reliability: We simplified the design review checklist for Reliability to make the recommendations more concise, focused, and actionable.
Service guides
This month, we made significant updates to some of our service guides. Here are the highlights:
- Azure Front Door: We improved content clarity by rephrasing various sections and adding more detailed explanations. We updated links and references to other relevant Azure documentation and removed redundant or outdated information. We aligned content with the latest Azure Front Door features and best practices.
- Azure IoT Hub: We refreshed and expanded this guide to cover all Well-Architected pillars. We also introduced new design considerations and recommendations, including guidance on using Azure IoT Hub for secure device-to-cloud communication, managing device identities, and optimizing costs.
January 2025
Updated articles
- How to use the Azure Well-Architected Framework documentation: We added a new section on adopting a phased learning process, emphasizing the importance of improving workload quality iteratively.
Service guides
This month, we made significant updates to many of our service guides. Here are some highlights:
- Azure App Service (Web Apps): Added and updated several design considerations and recommendations, including guidance on redundancy, taking advantage of the App Service auto-heal feature, and leveraging the App Service Resiliency Score Report.
- Azure Functions: We refreshed and expanded this guide to cover all Well-Architected pillars. We also introduced new design considerations and recommendations, including guidance on the use of managed identities, network security controls, cost monitoring, CI/CD pipelines, and autoscaling.
- Azure Kubernetes Service (AKS): We refreshed this guide to improve the structure and presentation of the content, and to reflect the latest best practices for configuring Azure Kubernetes Service (AKS) to protect your workloads and data, optimize costs, and improve operational efficiency.
- Azure Load Balancer: We refreshed and expanded this guide to cover all Well-Architected pillars. We also introduced new design considerations and recommendations, including guidance on optimizing your load balancing configurations, using the Global tier to load balance across Azure regions, and how to improve operational efficiency.
- Azure Service Fabric: We refreshed this guide to improve the structure and presentation of the content and introduced new guidance to help you optimize your Service Fabric workloads.
- Azure Traffic Manager: We refreshed and expanded this guide to cover all Well-Architected pillars. We also introduced new design considerations and recommendations, including guidance on load balancing across regions, enhancing DNS security, and using diagnostic logs and traffic view dashboards to optimize and troubleshoot Traffic Manager profiles.
- Azure Virtual Network: We refreshed and expanded this guide to cover all Well-Architected pillars. We also introduced new design considerations and recommendations, including guidance on adding redundancy, segmenting networks for security, and monitoring traffic patterns. Use infrastructure as code in your deployments, optimize network traffic, and leverage Azure Network Watcher for performance insights.
December 2024
New articles
- Azure Well-Architected Framework Perspective on Azure Disk Storage: Find new design considerations and configuration recommendations for optimizing Azure Disk Storage within the Azure Well-Architected Framework. Explore guidance on key areas such as reliability, security, cost optimization, operational excellence, and performance efficiency, as well as strategies and best practices to enhance storage management for Azure Virtual Machines.
Azure IoT Hub workload guidance retirement
- This month, we announced the deprecation of Azure IoT Hub workload documentation in the repo. The content was outdated and no longer aligned to the Azure Well-Architected Framework.
November 2024
New articles
- Well-Architected Framework Perspective on Azure Monitor Application Insights: Explore design considerations and configuration recommendations for Azure Monitor Application Insights. Azure Monitor Application Insights is an extensible Application Performance Management (APM) service that helps you monitor the performance and usage of your live web applications. It provides real-time insights into your application's performance and user behavior, enabling you to detect and diagnose issues and understand what users actually do with your app.
New workload: AI on Azure
This month, we introduced new guidance for designing AI workloads on Azure. This documentation is appropriate for roles that are accountable for designing, building, and maintaining a solution for running AI workloads in a cloud environment. Use the AI Workloads on Azure documentation as your go-to resource to build and optimize AI solutions on Azure.
- AI Workloads on Azure
- Design Methodology for AI Workloads on Azure
- Design Principles for AI Workloads on Azure
- Application Design for AI Workloads on Azure
- Application Platform for AI Workloads on Azure
- Design Training Data for AI Workloads on Azure
- Grounding Data Design for AI Workloads on Azure
- Data Platform for AI Workloads on Azure
- MLOps and GenAIOps for AI Workloads on Azure
- AI Workload Operations on Azure
- Test and Evaluate AI Workloads on Azure
- Responsible AI in Azure Workloads
- Workload Team Personas Involved in AI Workloads
- AI Workload Assessment
New workload: Software as a service (SaaS) on Azure
This month, we added a new workload for SaaS on Azure. This documentation provides actionable and authoritative guidance that applies Well-Architected best practices as the technical foundation for building and operating a SaaS solution on Azure at-scale. Use the SaaS Workloads on Azure documentation to build scalable, performant, reliable, and secure SaaS solutions.
- SaaS Workloads
- Design Methodology for SaaS Workloads on Azure
- Design Principles of SaaS Workloads on Azure
- Billing and Cost Management for SaaS Workloads on Azure
- Governance for SaaS Workloads on Azure
- Resource Organization for SaaS Workloads on Azure
- Identity and Access Management for SaaS Workloads on Azure
- Compute for SaaS Workloads on Azure
- Networking for SaaS Workloads on Azure
- Data for SaaS Workloads on Azure
- DevOps Practices for SaaS Workloads on Azure
- Incident Management for SaaS Workloads on Azure
- Assessment Review Tool for SaaS Workloads on Azure
October 2024
Updated articles
Architecture decision record (ADR): We refreshed the guidance on what an ADR should include, including consistent elements like problem statements, options considered, and decision outcomes. Explore updates including a new section on suggested characteristics of an individual record with guidelines for maintaining consistent and useful ADRs.
Azure Well-Architected Framework perspective on Azure Application Gateway v2: We made significant updates to the guidance about Azure Application Gateway v2. Find important notes and links to additional resources for Azure Application Gateway configurations. Explore enhanced content with specific design principles, strategies, and recommendations for achieving architectural goals.
Azure Well-Architected Framework perspective on Azure ExpressRoute: We made significant updates to provide more comprehensive and structured guidance on Azure ExpressRoute, enhancing and expanding upon best practices, design principles, and optimization strategies. The detailed checklists and recommendations help in better planning and implementation, ensuring improved reliability, security, and cost efficiency.
We reviewed all tradeoff and design pattern articles for alignment with the content structure and to ensure that the guidance is up to date. Tradeoffs are an essential part of the Well-Architected Framework, as they help you understand the implications of design decisions on other pillars. Design patterns are reusable solutions to common problems that you might encounter when designing a workload. They help you understand how to design your workload to meet the goals of the Well-Architected Framework. Check out the updated articles:
- Architecture design patterns that support cost optimization
- Cost Optimization tradeoffs
- Architecture design patterns that support reliability
- Reliability tradeoffs
- Architecture design patterns that support security
- Security tradeoffs
- Architecture design patterns that support operational excellence
- Operational Excellence tradeoffs
- Architecture design patterns that support performance efficiency
- Performance Efficiency tradeoffs