Edit

Share via


Data persistence in Azure Cache for Redis

Important

Azure Cache for Redis announced its retirement timeline for all SKUs. We recommend moving your existing Azure Cache for Redis instances to Azure Managed Redis as soon as you can.

For more details about the retirement:

If an Azure Cache for Redis cache failure occurs, data loss is possible when nodes are down. Redis persistence allows you to persist the data stored in cache instances. If there's a hardware failure, the cache instance rehydrates with data from the persistence file when it comes back online.

This article describes Redis persistence, and how to configure and manage data persistence in your Premium and Enterprise-tier Azure Redis cache instances. The data persistence feature isn't available in Basic or Standard tiers, and is in preview in Enterprise and Enterprise Flash tiers.

The ability to persist data is an important way to boost the durability of a cache instance, because it stores all cache data in memory. Persistence should be a key part of your Azure Redis high availability and disaster recovery strategy.

Important

The data persistence functionality provides resilience for unexpected Redis node failures. Data persistence isn't a data backup or point in time recovery (PITR) feature. If corrupted data is written to the Redis instance, the corrupted data is also persisted. To make backups of your Redis instance, use the Export feature.

Important

If you're using persistence on the Premium tier, check to see if your storage account has soft delete enabled before using the data persistence feature. Using data persistence with soft delete causes high storage costs. For more information, see Should I enable soft delete?

Scope of availability

Tier Basic, Standard Premium Enterprise, Enterprise Flash
Available No Yes Yes (preview)

Types of Redis data persistence

Azure Redis offers two types of data persistence, the Redis database (RDB) format and the Append-only File (AOF) format.

  • RDB persistence persists a snapshot of your cache in a binary format and saves it in an Azure Storage account. You configure the backup frequency to determine how often to persist the snapshot. If a catastrophic event occurs that disables both the primary and replica cache, the cache reconstructs automatically using the most recent snapshot. For more information, see RDB advantages and RDB disadvantages.

  • AOF persistence saves every write operation to a log, and saves the log once per second to an Azure Storage account. If a catastrophic event occurs that disables both the primary and replica caches, the cache reconstructs automatically using the stored write operations. For more information, see AOF advantages and AOF disadvantages.

Requirements and limitations

  • Data persistence functionality provides resilience for unexpected Redis node failures. Data persistence isn't a data backup or PITR feature. If corrupted data is written to the Redis instance, the corrupted data also persists. To back up your Redis instance, use the Export feature.

  • Azure Cache for Redis persistence features are intended to restore data automatically to the same cache after data loss. You can't import persisted data files to a new or existing cache.

    • To move data across caches, use the Import and Export data features.

    • To generate any backups of data that can be added to a new cache, you can use automated scripts using PowerShell or Azure CLI that export data periodically.

  • Persistence isn't supported with caches that use passive geo-replication or active geo-replication.

  • On the Premium tier, data is persisted directly to an Azure Storage account that you own and manage.

  • The storage account for Premium-tier data persistence must be in the same region as the cache instance. However, you can use a storage account in a different subscription to persist data if you use managed identity to connect to the storage account.

  • It's best to disable the soft delete feature on the storage account you use for Premium-tier data persistence. Using data persistence with soft delete causes high storage costs. For more information, see Pricing and billing and Should I enable soft delete?

  • RDB files are backed up to storage in the form of page blobs. Page blobs aren't supported in storage accounts with Hierarchical Namespace (HNS) enabled, such as Azure Data Lake Storage Gen2, so persistence tends to fail in those storage accounts.

  • On the Premium tier, AOF persistence isn't supported with multiple replicas.

Data encryption

Because Redis persistence creates data at rest, it's important to encrypt this data. Encryption options vary based on the Azure Redis tier you use.

For the Premium tier, data streams directly from the cache instance to Azure Storage when persistence is initiated. Azure Storage automatically encrypts data when persisting it, but you can use several encryption methods, including Microsoft-managed keys (MMKs), customer-managed keys (CMKs), and customer-provided keys. For more information, see Azure Storage encryption for data at rest and Customer-managed keys for Azure Storage encryption.

Set up data persistence

You can use the Azure portal, Azure Resource Manager (ARM) templates, PowerShell, or Azure CLI to create and set up data persistence for Premium or Enterprise tier Azure Redis caches.

Prerequisites

  • To create and add persistence to Azure Redis caches, you need write access and permissions to create Premium or Enterprise-level caches in an Azure subscription.
  • For Premium-tier caches, you need an Azure Storage account in the same region as your cache to store the cache data. If you use managed identity as the authentication method, you can use a storage account in a different subscription than your cache.
  • For the Azure PowerShell procedures, you need Azure PowerShell installed, or use Azure Cloud Shell with the PowerShell environment in the Azure portal.
  • For the Azure CLI procedures, you need Azure CLI installed, or use Azure Cloud Shell with the Bash environment in the Azure portal.

Set up data persistence in the Azure portal

In the Azure portal, you can set up data persistence when you create your Azure Redis Premium or Enterprise-level cache instance.

Note

You can also add persistence to a previously created cache by navigating to Data persistence under Settings in the left navigation menu for your cache.

  1. To create a Premium cache in the Azure portal, follow the instructions at Quickstart: Create an open-source Redis cache, and select Premium for the Cache SKU on the Basics tab.

    Screenshot that shows a form to create an Azure Cache for Redis resource.

  2. When you fill out the Advanced tab, select either RDB or AOF persistence for Backup file under Data persistence, and configure the relevant settings.

    Screenshot showing the settings for RDB data persistence.

    • For RDB, configure these settings:

      Setting Value Description
      Authentication Method Select Managed Identity or Storage Key Using managed identity allows you to use a storage account in a different subscription than your cache.
      Subscription Select the subscription that contains your managed identity. This item appears only if you chose Managed Identity authentication.
      Backup Frequency Select a backup interval: 15 minutes, 30 minutes, 60 minutes, 6 hours, 12 hours, or 24 hours. This interval starts counting down after the previous backup operation successfully completes. When the interval elapses, a new backup starts.
      Storage Account Select your storage account. The storage account must be in the same region as the cache. A Premium storage account is recommended because it has higher throughput.
      Storage Key Select either the Primary key or the Secondary key to use. This item appears only if you chose Storage Key authentication. If the storage key for your persistence storage account is regenerated, you must reconfigure the key from the Storage Key dropdown.
    • For AOF, configure these settings:

      Setting Value Description
      Authentication Method Select Managed Identity or Storage Key Using managed identity allows you to use a storage account in a different subscription than your cache.
      Subscription Select the subscription that contains your managed identity. This item appears only if you chose Managed Identity authentication.
      First Storage Account Select your storage account. The storage account must be in the same region as the cache. A Premium storage account is recommended because it has higher throughput.
      First Storage Key Select either the Primary key or Secondary key to use. This item appears only if you chose Storage Key authentication. If the storage key is regenerated, you must reconfigure the key from the Storage Key dropdown list.
      Second Storage Account Optionally select a secondary storage account. If you configure a secondary storage account, the writes to the replica cache are persisted to this second storage account.
      Second Storage Key Choose either the Primary key or Secondary key to use. This item appears only if you chose Storage Key authentication. If the storage key is regenerated, you must reconfigure the key.
  3. Complete all the tabs and finish creating the cache by following the rest of the instructions at Quickstart: Create an open-source Redis cache.

With RDB persistence, the first backup starts once the backup frequency interval elapses.

With AOF persistence, write operations to the cache save to the named storage account or accounts. If there's a catastrophic failure that takes down both the primary and replica caches, the stored AOF log is used to rebuild the cache.

Set up data persistence using Azure PowerShell

You can use Azure PowerShell to set up data persistence when you create an Azure Redis Premium or Enterprise-tier cache, or to add persistence to a previously created cache.

You can use the New-AzRedisCache command to create a new Azure Redis Premium-tier cache that uses data persistence.

To update existing caches to use data persistence, run the Set-AzRedisCache command. For instructions, see Add persistence to an existing cache.

Set up data persistence using Azure CLI

You can use Azure CLI to set up data persistence when you create an Azure Redis Premium or Enterprise-tier cache, or to add persistence to a previously created cache.

You can use the az redis create command to create a new Premium-tier cache that uses data persistence. For example:

az redis create --location westus2 --name MyRedisCache --resource-group MyResourceGroup --sku Premium --vm-size p1 --redis-configuration @"config_rdb.json"

To update an existing cache, use the az redis update command. For example:

az redis update --name MyRedisCache --resource-group MyResourceGroup --set "redisConfiguration.rdb-storage-connection-string"="BlobEndpoint=https//..." "redisConfiguration.rdb-backup-enabled"="true" "redisConfiguration.rdb-backup-frequency"="15" "redisConfiguration.rdb-backup-max-snapshot-count"="1"

Persistence FAQ

This section contains answers to commonly asked questions about Azure Redis cache persistence.

RDB persistence

AOF persistence

Can I enable persistence on a previously created cache?

Yes, you can configure persistence at cache creation and on existing Premium, Enterprise, or Enterprise Flash caches.

Can I enable AOF and RDB persistence at the same time?

No, you can enable RDB or AOF, but not both at once.

How does persistence work with geo-replication?

Data persistence doesn't work with geo-replication enabled.

Which persistence model should I choose?

AOF persistence writes to a log once per second, while RDB persistence saves backups based on the configured backup interval. RDB persistence has less effect on throughput and performance than AOF persistence.

Choose AOF persistence if your primary goal is to minimize data loss and you can handle a lower throughput for your cache. Choose RDB persistence if you wish to maintain optimal throughput on your cache but still want a mechanism for data recovery.

For more information, see RDB advantages, RDB disadvantages, AOF advantages, and AOF disadvantages.

Does AOF persistence affect throughput, latency, or performance of my cache?

AOF persistence affects throughput. Because AOF runs on both the primary and replica process, you see higher CPU and Server Load for a cache with AOF persistence than on an identical cache without AOF persistence. AOF offers the best consistency with the data in memory because each write and delete is persisted with only a few seconds of delay. The tradeoff is that AOF is more compute intensive.

As long as CPU and Server Load are both less than 90%, there's a penalty on throughput, but the cache operates normally. Above 90% CPU and Server Load, the throughput penalty can get higher, and the latency of all commands processed by the cache increases. Latency increases because AOF persistence runs on both the primary and replica process, increasing the load on the node in use, and putting persistence on the critical path of data.

What happens if I scale to a different size and a backup is restored that was made before the scaling operation?

  • If you scaled to a larger size, there's no effect.
  • If you scaled to a smaller size, and you have a custom databases setting that's greater than the databases limit for your new size, data in those databases isn't restored. For more information, see Is my custom databases setting affected during scaling?
  • If you scaled to a smaller size, and there isn't enough room in the smaller size to hold all the data from the last backup, keys are evicted during the restore process. Typically, keys are evicted using the allkeys-lru eviction policy.

Can I use the same storage account for persistence across two different caches?

No, you must use different storage accounts. Each cache must have its own storage account to set up for persistence.

Important

Also use separate storage accounts for persistence and performing periodic export operations on a cache.

Am I charged for the storage being used in data persistence?

  • For Premium caches, you're charged for the storage used per the pricing model of the storage account.
  • For Enterprise and Enterprise Flash caches, the managed disk storage is included in the price and doesn't incur extra charges.

How frequently does RDB and AOF persistence write to my blobs, and should I enable soft delete?

RDB and AOF persistence can write to your storage blobs as frequently as every hour, every few minutes, or every second. Soft delete quickly becomes expensive with the typical data sizes of a cache that also performs write operations every second. Enabling soft delete on a storage account also means Azure Redis can't minimize storage costs by deleting the old backup data.

It's best to avoid enabling soft delete on storage accounts you use for Azure Redis Premium-tier data persistence. For more information on soft delete costs, see Pricing and billing.

Can I change the RDB backup frequency after I create the cache?

Yes, you can change the backup frequency for RDB persistence by using the Azure portal, Azure CLI, or Azure PowerShell.

Why is there more than 60 minutes between backups when I have an RDB backup frequency of 60 minutes?

The RDB persistence backup frequency interval doesn't start until the previous backup process completes successfully. If the backup frequency is 60 minutes and it takes a backup process 15 minutes to complete, the next backup doesn't start until 75 minutes after the start time of the previous backup.

What happens to the old RDB backups when a new backup is made?

All RDB persistence backups, except for the most recent one, are automatically deleted. This deletion might not happen immediately, but older backups aren't persisted indefinitely. If you're using the Premium tier for persistence, and soft delete is turned on for your storage account, the existing backups continue to reside in the soft delete state.

When should I use a second storage account?

Use a second storage account for AOF persistence when you expect to have higher than usual SET operations on the cache. Using the secondary storage account helps ensure your cache doesn't reach storage bandwidth limits. This option is available only for Premium-tier caches.

How can I remove the second storage account?

You can remove the AOF persistence secondary storage account by setting the second storage account to be the same as the first storage account. To change the settings for existing caches, select Data persistence under Settings on the left navigation menu of your cache page. To disable persistence entirely, select Disabled on the Data persistence page.

What is a rewrite and how does it affect my cache?

When an AOF file becomes large enough, a rewrite is automatically queued on the cache. The rewrite resizes the AOF file with the minimal set of operations needed to create the current data set.

During rewrites, you can expect to reach performance limits sooner, especially when dealing with large datasets. Rewrites occur less often as the AOF file becomes larger, but take a significant amount of time when they occur.

What should I expect when scaling a cache with AOF enabled?

If the AOF file at the time of scaling is large, expect the scale operation to take longer than usual, because it reloads the file after scaling finishes. Also see What happens if I scale to a different size and a backup is restored that was made before the scaling operation?

How is my AOF data organized in storage?

When you use the Premium tier, data stored in AOF files is divided into multiple page blobs per shard. By default, half of the blobs are saved in the primary storage account and half are saved in the secondary storage account. Splitting the data across multiple page blobs and two different storage accounts improves performance.

If the peak rate of writes to the cache isn't high, this extra performance might not be needed. In that case, the secondary storage account configuration can be removed, and all the AOF files stored in the single primary storage account. The following table displays how many total page blobs each pricing tier uses.

Premium tier Blobs
P1 8 per shard
P2 16 per shard
P3 32 per shard
P4 40 per shard

When clustering is enabled, each shard in the cache has its own set of page blobs, per the preceding table. For example, a P2 cache with three shards distributes its AOF file across 48 page blobs: Sixteen blobs per shard, with three shards.

After a rewrite, two sets of AOF files exist in storage. Rewrites occur in the background and append to the first set of files. SET operations sent to the cache during the rewrite append to the second set of files.

If there's a failure during a rewrite, a backup is temporarily stored. The backup is promptly deleted after the rewrite finishes. If soft delete is turned on for your storage account, the soft delete setting applies and existing backups continue to stay in the soft delete state.

Does having firewall exceptions on the storage account affect persistence?

Yes. For persistence in the Premium tier, using firewall settings on the storage account can prevent the persistence feature from working.

You can check for errors in persisting data by viewing the Errors metric. This metric indicates if the cache is unable to persist data due to firewall restrictions on the storage account or other problems.

To use data persistence with a storage account that has a firewall set up, use managed identity based authentication to connect to storage. Using managed identity adds the cache instance to the trusted services list, making firewall exceptions easier to apply. If you authorize to the storage account using a key instead of managed identity, having firewall exceptions on the storage account tends to break the persistence process.

Can I have AOF persistence enabled if I have more than one replica?

With the Premium tier, you can't use AOF persistence with multiple replicas. In the Enterprise and Enterprise Flash tiers, replica architecture is more complicated, but AOF persistence is supported when Enterprise caches are used in zone redundant deployments.

How do I check if soft delete is enabled on my storage account?

In the Azure portal, select the storage account your cache uses for persistence, and select Data protection under Data management in its left navigation menu. On the Data protection page, check whether Enable soft delete for blobs is enabled. For more information on soft delete in Azure storage accounts, see Enable soft delete for blobs.

Can I use a storage account in a different subscription from the one where my cache is located?

You can choose a storage account in a different subscription only if you use managed identity as the storage account authentication method.

Learn more about Azure Cache for Redis features.