Low Level Design For Batch and Streaming Data Platform In Azure

Question

Low Level Design For Batch and Streaming Data Platform In Azure

Relay 320

Hello,

I am trying to create Low level design Document for below mention diagram.

May Someone please help me what all should be consider for each component.

How to handle Error and Failure and Restartibility.

ASA-ASQL-LLD

Thank You So Much for your help.

Ganesh Gurram 7,230 Reputation points Moderator

2025-05-20T16:36:02.79+00:00

@Relay

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.
Ganesh Gurram 7,230 Reputation points Moderator

2025-05-21T17:20:11.44+00:00

@Relay

Just checking in to see if the below answer helped. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

1 answer

Your answer

Ganesh Gurram 7,230 Reputation points Moderator

2025-05-20T16:36:02.79+00:00

@Relay

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.
Ganesh Gurram 7,230 Reputation points Moderator

2025-05-21T17:20:11.44+00:00

@Relay

Just checking in to see if the below answer helped. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

@Relay

It sounds like you’re diving into creating a low-level design document for a batch and streaming data platform on Azure. That's quite the task! Here are some components and considerations you might want to keep in mind for each element of your design:

Components to Consider

Data Sources - Identify all data sources including IoT devices, databases, and external APIs. Ensure that you have mechanisms to validate data integrity and handle data source unavailability.

Data Ingestion:

Batch: Use tools like Azure Data Factory for scheduled data extraction.
Streaming: Consider Azure Stream Analytics or Azure Event Hubs for real-time data ingestion.

Data Storage - Design your data lake or storage solution.

Delta Lake: Great for both batch and streaming; ensures ACID properties.
Blob Storage: Use for storing raw data and for batch processing.

Data Processing:

Batch Processing - Using Azure Databricks with scheduled notebooks or Azure Data Factory for transformations.
Streaming Processing - Utilize Azure Stream Analytics or Spark Streaming in Databricks for real-time data processing.

Data Transformation - Incorporate ETL processes tailored to your architecture. Be sure to consider both batch transformations (e.g., Aggregation, Joins) and streaming transformations (Real-time filtering, aggregations).

Error Handling and Failure Management: Retries - Implement retry policies for transient errors.
Dead Letter Queue - For messages that fail processing to be reviewed later.
Monitoring and Alerts - Use Azure Monitor to track the health of your pipelines and set up alerts for failures.

Data Quality Checks - Consider validations to ensure data integrity and accuracy. Integrate checkpoints in your processing flows.

Performance Optimization - Scale your Spark clusters based on workload. Leverage partitioning strategies to improve performance for both batch and streaming jobs.

Security and Compliance - Implement access controls, encryption for data at rest and in transit. Ensure compliance with data handling regulations relevant to your business.

Documentation and Training - Make sure to document all components and provide guidelines for those who will maintain this architecture.

Restartability

For restartability in your system, you need to - Maintain stateful information in workflows, so on failure, the process resumes without data loss (e.g., using checkpointing in Spark).
Record the last processed records' offsets for message-based systems to replay data if necessary.
Design your batch jobs to be idempotent, enabling safe retries. For more details refer:

I hope this information helps.

Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.

Share via

Low Level Design For Batch and Streaming Data Platform In Azure

1 answer

Your answer