Skip to main content

Follower fetching in Aiven for Apache Kafka®

Follower fetching in Aiven for Apache Kafka allows consumers to retrieve data from the nearest replica instead of always fetching from the partition leader. This feature optimizes data fetching by leveraging Apache Kafka's rack awareness, which treats each availability zone (AZ) as a rack.

Benefits

  1. Reduced network costs: Fetching data from the closest replica minimizes inter-zone data transfers, reducing costs.
  2. Lower latency: Fetching from a nearby replica reduces the time it takes to receive data, improving overall performance.
note

Follower fetching is supported on AWS (Amazon Web Services) and Google Cloud.

How it works

Aiven for Apache Kafka uses rack awareness to optimize data fetching and maintain availability. Each availability zone (AZ) is treated as a rack.

Follower fetching

Rack awareness

Rack awareness distributes data across different physical racks, also known as availability zones, within a data center. This distribution ensures data replication for fault tolerance. Each Apache Kafka broker has a broker.rack setting corresponding to its specific AZ:

  • AWS: Uses AZ IDs such as use1-az1
  • Google Cloud: Uses AZ names such as europe-west1-b

Aiven for Apache Kafka automatically manages the broker.rack setting, eliminating the need for manual configuration.

Follower fetching mechanism

Follower fetching builds on rack awareness to allow consumers to fetch data from the nearest replica. Apache Kafka consumers use the client.rack setting to specify their AZ, ensuring they fetch data from the closest replica when possible.

Configuration settings

  • broker.rack: This setting corresponds to the AZ where each Apache Kafka broker is deployed and helps manage data replication efficiently. Apache Kafka brokers in the same AZ have the same broker.rack value, like use1-az1. Aiven for Apache Kafka simplifies this process by automatically managing the broker.rack setting, eliminating the need for manual configuration.

  • client.rack: This setting on the Apache Kafka consumer indicates the AZ where the consumer is running. It allows you to fetch data from the nearest replica. For example, setting client.rack to use1-az1 on AWS or europe-west1-b on Google Cloud ensures that the consumer fetches data from the nearest broker in the same AZ. Configure this setting to retrieve data from the closest replica.

Follower fetching in Kafka Connect and MirrorMaker 2

Aiven for Apache Kafka® Connect and Aiven for Apache Kafka® MirrorMaker 2 apply rack awareness when follower fetching is enabled on the Aiven for Apache Kafka service. Each service sets a rack value based on the node’s availability zone (AZ) so that traffic is routed to the closest replica.

Kafka Connect

Kafka Connect assigns a rack value based on the AZ where each node runs. All source connectors on that node inherit this value unless a connector specifies its own consumer rack configuration.

MirrorMaker 2

MirrorMaker 2 assigns a rack value based on the node’s AZ when follower_fetching_enabled=true in the service configuration. A custom rack_id in the integration configuration overrides the AZ-based value. Rack awareness is not applied to integrations that connect to external Kafka clusters.

Related pages