You are doing complex calculations using a machine learning framework on records fetched from a Kafka topic. It takes more about 6 minutes to process a record batch, and the consumer enters rebalances even though it's still running. How can you improve this scenario?
Correct Answer:
A
Here, we need to change the setting max.poll.interval.ms (default 300000) to its double in order to tell Kafka a consumer should be considered dead if the consumer only if it hasn't called the .poll() method in 10 minutes instead of 5.
You are building a consumer application that processes events from a Kafka topic. What is the most important metric to monitor to ensure real-time processing?
Correct Answer:
B
This metric shows the current lag (number of messages behind the broker)
We would like to be in an at-most once consuming scenario. Which offset commit strategy would you recommend?
Correct Answer:
D
Here, we must commit the offsets right after receiving a batch from a call to .poll()
What client protocol is supported for the schema registry? (select two)
Correct Answer:
AB
clients can interact with the schema registry using the HTTP or HTTPS interface
Which actions will trigger partition rebalance for a consumer group? (select three)
Correct Answer:
ACD
Rebalance occurs when a new consumer is added, removed or consumer dies or paritions increased.