Edit

Share via


Refactor mainframe applications with Astadia

Azure ExpressRoute
Azure Bastion
Azure Load Balancer
Azure Private Link
Azure Site Recovery

Astadia’s automated COBOL refactoring solution delivers cloud-enabled applications and databases that do the same things as their legacy counterparts. The refactored applications run as Azure applications in virtual machines provided by Azure Virtual Machines. Azure ExpressRoute makes them available to users, and Azure Load Balancer distributes the load.

Mainframe architecture

Here's a mainframe architecture that represents the kind of system that's suitable for the Astadia refactoring solution.

Diagram for a mainframe architecture that's suitable for Astadia refactoring.

Download a Visio file of this architecture.

Dataflow

  1. TN3270 and HTTP(S) user input arrives over TCP/IP.
  2. Mainframe input uses standard mainframe protocols.
  3. There are batch and online applications.
  4. Applications written in COBOL, PL/I, Assembler, and other languages run in an enabled environment.
  5. Data is held in files and in hierarchical, network, and relational databases.
  6. Commonly used services include program execution, I/O operations, error detection, and protection within the environment.
  7. Middleware and utility services manage tape storage, queueing, output, and web activity.
  8. Each operating system runs in its own partition.
  9. Partitions segregate different workloads or work types.

Azure architecture

Here's an Azure architecture to replace the mainframe functionality with refactored applications.

Architecture diagram for an Astadia refactoring solution.

Download a Visio file of this architecture.

Dataflow

  1. Input comes from remote clients and other users via ExpressRoute. TCP/IP is the primary way to connect to the system.
    • On-premises users access web-based applications over Transport Layer Security (TLS) port 443. The user interfaces stay the same to minimize end user retraining.
    • On-premises administrative access uses Azure Bastion hosts.
    • Azure users connect to the system via virtual network peering.
  2. Load Balancer manages access to the application compute clusters. Load Balancer supports scale-out compute resources to handle input. It operates at level-7, application level, or level-4, network level, depending on the application input.
  3. Astadia runtime libraries run refactored applications on Azure Virtual Machines. Compute resources use Azure Premium SSD or Azure Ultra Disk Storage managed disks with accelerated networking.
  4. Data services in the application clusters support multiple connections to persistent data sources. Azure Private Link provides private connectivity from inside the virtual network to Azure services. Data sources include data services such as Azure SQL Database and Azure PostgreSQL.
  5. Data storage is local-redundant or geo-redundant, depending on usage. It's a mixture of:
    • High-performance storage:
      • Premium SSD
      • Ultra Disk Storage
    • Azure Standard SSD, including blob, archive, and backup storage
  6. Azure data services provide scalable and highly available data storage that compute clusters share. The storage can be geo-redundant.
    • Azure Blob Storage serves as a landing zone for data from external data sources.
    • Azure Data Factory ingests data and synchronizes multiple Azure and external data sources.
  7. Azure Site Recovery provides disaster recovery for virtual machines (VMs) and container cluster components.
  8. Services like Microsoft Entra ID, Azure Networking, Azure DevOps, Azure Stream Analytics, Azure Databricks, GitHub, and Power BI are easily integrated with the modernized system.

Components

  • Azure Bastion is a fully managed service that provides Remote Desktop Protocol (RDP) or Secure Shell (SSH) connectivity to virtual network VMs from the Azure portal over TLS. Azure Bastion maximizes administrative access security by minimizing open ports. In this architecture, Azure Bastion provides secure administrative access to the VMs that host the refactored COBOL applications.

  • Azure Data Factory is an extract, transfer, and load (ETL) service for scale-out serverless data integration and data transformation. It provides a code-free UI for intuitive authoring and single-pane-of-glass monitoring and management. In this architecture, Data Factory provides data integration and transformation capabilities for migrating and processing mainframe data in the refactored solution.

  • Azure Storage is scalable, secure cloud storage for all your data, applications, and workloads. In this architecture, Azure Storage provides various storage options to support the data requirements of the refactored mainframe applications.

    • Azure Disk Storage is high-performance, durable block storage for business-critical applications. Azure managed disks are block-level storage volumes that are managed by Azure on VMs. The available types of disks are Ultra Disk Storage, Premium SSD, and Standard SSD. In this architecture, Azure Disk Storage provides high-performance storage for the refactored applications by using either Premium SSD or Ultra Disk Storage.

    • Azure Files is a fully managed file share service that provides fully managed file shares in the cloud that are accessed via the industry standard Server Message Block (SMB) protocol. Cloud and on-premises Windows, Linux, and macOS deployments share access by mounting file shares concurrently. In this architecture, Azure Files provides shared file storage that can be accessed by multiple instances of the refactored applications.

    • Azure NetApp Files is an enterprise-grade file share service that provides enterprise-grade Azure file shares that are powered by NetApp. Azure NetApp Files makes it easy for enterprises to migrate and run complex, file-based applications without changing code. In this architecture, Azure NetApp Files provides high-performance file storage for enterprise-grade refactored mainframe applications that require advanced file system capabilities.

    • Blob Storage is scalable and secure object storage for archives, data lakes, high-performance computing, machine learning, and cloud-native workloads. In this architecture, Blob Storage provides object storage for unstructured data and archival requirements of the refactored mainframe applications.

  • Azure Virtual Network is the fundamental building block of Azure private networks. VMs within virtual networks communicate securely with each other, with the internet, and with on-premises networks. A virtual network is like a traditional on-premises network, but with Azure infrastructure benefits like scalability, high availability, and isolation. In this architecture, Virtual Network provides the secure network foundation and connectivity for all components of the refactored mainframe system.

  • ExpressRoute is a connectivity service that extends on-premises networks into Azure over a private, dedicated fiber connection from a connectivity provider. ExpressRoute establishes connections to Microsoft cloud services like Azure and Microsoft 365. In this architecture, ExpressRoute provides secure, high-bandwidth connectivity between on-premises mainframe environments and the refactored applications that run on Azure.

  • Load Balancer is a network load balancer that distributes incoming traffic to the compute resource clusters. It uses configurable rules and other criteria to distribute the traffic. In this architecture, Load Balancer provides high availability and scalability for the refactored mainframe applications by distributing user traffic across multiple VM instances.

  • Private Link is a service that provides private connectivity from virtual networks to Azure services. Private Link simplifies network architecture and secures the connection between Azure endpoints by eliminating public internet exposure. In this architecture, Private Link enables secure, private connections between the refactored applications and Azure platform as a service (PaaS) services.

  • Azure has fully managed relational, NoSQL, and in-memory databases to fit modern application needs. Automated infrastructure management provides scalability, availability, and security. For an overview of the database types, see Types of Databases on Azure. In this architecture, Azure databases provide persistent data storage solutions to replace mainframe database systems.

    • Azure Cosmos DB is a fully managed, fast NoSQL database with open APIs for any scale. In this architecture, Azure Cosmos DB provides globally distributed NoSQL database services for refactored applications that require flexible, scalable data storage.

    • Azure Database for PostgreSQL is a fully managed database that's based on the open-source Postgres relational database engine. For applications that require greater scale and performance, the Hyperscale (Citus) deployment option scales queries across multiple machines by sharding them. In this architecture, Azure Database for PostgreSQL provides open-source relational database services for refactored applications that require PostgreSQL capabilities.

    • SQL Database is a fully managed database engine. SQL Database always runs on the latest stable version of SQL Server and a patched OS with high availability. Built-in database management capabilities include upgrading, patching, backups, and monitoring. In this architecture, SQL Database provides managed relational database services for refactored applications that require SQL Server compatibility.

  • Site Recovery is a disaster recovery service that mirrors VMs to a secondary Azure region for quick failover and disaster recovery if an Azure datacenter fails. In this architecture, Site Recovery provides business continuity and disaster recovery capabilities for the refactored mainframe applications.

  • Virtual Machines is a compute service that provides many sizes and types of on-demand, scalable VMs. By using Virtual Machines, you get the flexibility of virtualization, and you don't have to buy and maintain physical hardware. In this architecture, Virtual Machines hosts the refactored COBOL applications and provides the compute infrastructure for running the modernized workloads.

Scenario details

There are important reasons why companies should replace their COBOL and mainframe systems:

  • Scarcity of domain experience: Developers who understand COBOL and mainframe technology are retiring, and few developers are trained to replace them. The talent pool gets steadily smaller and the costs and risks of relying on COBOL rise.
  • Limited flexibility: COBOL and the underlying systems that support it weren't designed for modern cloud-based applications. They're inflexible and hard to integrate.
  • Exorbitant costs: IBM mainframe hardware and software costs are high. Licensing and maintenance fees for ancillary mainframe applications and databases are rising.

There is a way forward for COBOL and mainframe systems. Astadia’s automated COBOL refactoring solution delivers cloud-enabled applications and databases that do the same things as their legacy counterparts. The refactored applications run as Azure applications in virtual machines provided by Azure Virtual Machines. Azure ExpressRoute makes them available to users, and Azure Load Balancer distributes the load.

Refactoring reduces costs and allows for deeper integration and for customization to meet business requirements. The hassles and costs of COBOL and the mainframe give way to a new world of quality and scalability that includes:

  • Automated testing and quality assurance.
  • Docker and Kubernetes for containerized deployment and orchestration.

The refactoring solution creates applications that:

  • Are functionally equivalent to their original counterparts.
  • Are written in your choice of Java or C#.
  • Follow object-oriented concepts and paradigms.
  • Are easy to maintain.
  • Perform as well as the applications they replace, or better.
  • Are cloud-ready.
  • Are delivered using a standard DevOps toolchain and best practices.

The refactoring process includes flow normalization, code restructuring, data layer extraction, data remodeling, and packaging for reconstruction. It identifies cloned code and replaces it with shared objects for simpler maintenance and manageability. The process also identifies and removes dead code by analyzing data and control dependencies.

Java and C# developers adapt refactored applications for cloud optimization by using standard DevOps tools and continuous integration and continuous delivery (CI/CD) concepts. Such tools and methods aren’t available for mainframe applications. Optimization delivers efficiencies and business benefits such as elasticity, granular service definition, and easy integration with cloud-native services.

Potential use cases

Automated refactoring is available for most COBOL dialects and platforms, including z/OS, OpenVMS, and VME. Candidates for using it include organizations seeking to:

  • Modernize infrastructure and escape the high costs, limitations, and rigidity of mainframe systems.
  • Avoid the risks of shortages of COBOL and mainframe developers.
  • Reduce operational costs and capital expenditures.
  • Move mainframe workloads to the cloud without the costs and risks of prolonged manual rewrites.
  • Migrate mission-critical applications to the cloud while maintaining continuity with other on-premises applications.
  • Make their systems horizontally and vertically scalable.
  • Implement disaster recovery techniques.

Considerations

These considerations implement the pillars of the Azure Well-Architected Framework, which is a set of guiding tenets that can be used to improve the quality of a workload. For more information, see Microsoft Azure Well-Architected Framework.

Reliability

Reliability ensures your application can meet the commitments you make to your customers. For more information, see Design review checklist for Reliability.

  • The architecture uses Site Recovery to mirror VMs to a secondary Azure region for quick failover and disaster recovery if an Azure datacenter fails.
  • The auto-failover groups feature of SQL Database provides data protection by managing database replication and failover to the secondary region. For more information, see Auto-failover groups overview and best practices (Azure SQL Database).
  • Resiliency is built into this solution by using Load Balancer. If one presentation or transaction server fails, other servers run the workloads.
  • We recommend that you create availability sets for your VMs to increase availability. For more information, see Availability sets overview.
  • We recommend that you use geo-replication to increase reliability. For more information, see Azure Storage redundancy.

Security

Security provides assurances against deliberate attacks and the abuse of your valuable data and systems. For more information, see Design review checklist for Security.

  • This solution uses an Azure network security group to manage traffic to and from Azure resources. For more information, see Network security groups.
  • Private Link for Azure SQL Database provides a private, direct connection that's isolated to the Azure networking backbone and that runs between VMs and SQL Database.
  • Azure Bastion maximizes admin access security by minimizing open ports. It provides secure and seamless RDP/SSH connectivity to virtual network VMs directly from the Azure portal over TLS.

Cost Optimization

Cost Optimization is about looking at ways to reduce unnecessary expenses and improve operational efficiencies. For more information, see Design review checklist for Cost Optimization.

  • Azure avoids unnecessary costs by identifying the correct number of resource types, analyzing spending over time, and scaling in advance to meet business needs without overspending.
  • Azure minimizes costs by running on VMs. You can turn off the VMs that aren't being used, and provide a schedule for known usage patterns. For more information about cost optimization for VMs, see Virtual Machines.
  • The VMs in this architecture use either Premium SSD or Ultra Disk Storage. For more information about disk options and pricing, see Managed Disks pricing.
  • SQL Database optimizes costs with serverless compute and Hyperscale storage resources that automatically scale. For more information about SQL Database options and pricing, see Azure SQL Database pricing.
  • Use the Pricing calculator to estimate costs for your implementation of this solution.

Operational Excellence

Operational Excellence covers the operations processes that deploy an application and keep it running in production. For more information, see Design review checklist for Operational Excellence.

Refactoring not only supports faster cloud adoption, but also promotes adoption of DevOps and agile development principles. You have full flexibility in development and production deployment options.

Performance Efficiency

Performance Efficiency is the ability of your workload to scale to meet the demands placed on it by users in an efficient manner. For more information, see Design review checklist for Performance Efficiency.

This solution supports deployment in containers, VMs, or Virtual Machine Scale Sets. Containers and Virtual Machine Scale Sets, unlike VMs, scale out and in rapidly. Shifting the unit of scaling to containers optimizes infrastructure utilization.

Contributors

This article is maintained by Microsoft. It was originally written by the following contributors.

Principal author:

Other contributor:

Next steps

  • For more information, contact legacy2azure@microsoft.com.

Azure

Astadia website