Overview

Aeron Cluster Standby brings improved availability, disaster recovery, and load management to your Aeron applications. It provides elevated capabilities of resilience and redundancy to critical systems, ensuring high availability and fault tolerance at all times, hence minimizing the impact of failures or outages on daily operations.

Key Features

  • Improved Availability: Ensures high availability by having standby nodes ready to take over operations in case of a failure or planned downtime.
  • Disaster Recovery: Reduces time to recovery and increases overall system availability by leveraging live cluster streaming and background snapshots.
  • Load Management: Standby nodes can run behind the active cluster without creating backpressure or latency issues, enabling features like snapshotting and offloading of slower operations.
  • Flexibility: Allows configurations that balance bandwidth, costs, and resilience, with options to route all traffic through a single node or have all nodes receive the same data.

Technical Details

  • Replicated State Machine Model: Aeron Cluster is based on a replicated state machine model where active cluster nodes work together to provide consensus on log entries.
  • Standby Node Operations: Each standby cluster node processes every message, ensuring internal state consistency with the primary cluster as quickly as the logic and network allow.
  • Data Loss Minimization: Data loss is limited to information in transit at the time of failure.

Operational Considerations

  • Network Layout: Users must be familiar with the network layout, bandwidth, and link costs before deploying.
  • Availability: Available as an Aeron Premium feature.

Datasheet

For a detailed overview of Aeron Cluster Standby, download the datasheet: