Question 1

How does Kafka guarantee message ordering?

Accepted Answer

Kafka guarantees ordering within a partition but NOT across partitions. Messages in a single partition are appended in order and delivered to consumers in the same order — this is an immutable log guarantee. Messages across different partitions of the same topic can be interleaved in any order. To guarantee ordering for related messages (e.g., all events for a specific user or order), use a partition key: Kafka hashes the key and routes all messages with the same key to the same partition. Example: producer.send(topic, key=user_id, value=event). All events for user_id=123 land on the same partition and are processed in order. Tradeoff: using a key concentrates traffic — if one user generates significantly more events (hot partition), one consumer gets overloaded. Use consistent hashing with many partitions (100+) to spread load while preserving per-key ordering.

Question 2

What is a Kafka consumer group and how does it enable parallel consumption?

Accepted Answer

A consumer group is a set of consumers that jointly consume a topic. Kafka assigns each partition to exactly one consumer in the group — two consumers in the same group never process the same partition simultaneously. This provides parallel consumption without duplicate processing: if a topic has 12 partitions and 4 consumers in a group, each consumer processes 3 partitions. Adding consumers increases parallelism up to the number of partitions — a 13th consumer in a 12-partition topic would be idle. Consumers in different groups each get all messages independently — one group for real-time processing, another for archiving, another for analytics, all consuming the same topic without coordination. Offset tracking is per consumer group: each group maintains its own offset per partition, so progress of one group doesn't affect others. Group coordinator (a broker) handles rebalancing when consumers join/leave.

Question 3

What is the difference between at-least-once and exactly-once delivery in Kafka?

Accepted Answer

At-least-once: commit the offset AFTER successfully processing the message. If the consumer crashes after processing but before committing, it re-reads and reprocesses the message on restart — duplicate processing is possible. This is the default and most common mode. Handle idempotency in the consumer: check if the message_id was already processed (using Redis SET or a DB unique constraint) before applying changes. Exactly-once: Kafka provides exactly-once semantics (EOS) using idempotent producers (each message gets a sequence number; broker deduplicates retries) + transactional APIs (atomically write to multiple topics and commit offsets in one transaction). Requires acks=all, enable.idempotence=true, and transactional.id configured. Adds ~20-30% latency overhead. Use exactly-once when: financial transactions, inventory updates, or any operation where duplicates cause correctness issues that cannot be handled by idempotency checks.

Guarantee	How	Tradeoff
At-most-once	Commit offset before processing	May miss messages on crash
At-least-once	Commit offset after successful processing	May reprocess on crash (duplicates possible)
Exactly-once	Idempotent producer + transactional consumer	Higher latency, more complex

System Design Interview: Design a Distributed Message Queue (Kafka)

System Design: Distributed Message Queue (Kafka)

Requirements

Core Concepts

Data Model

Write Path

Read Path

Delivery Guarantees

Replication and Fault Tolerance

Scale Numbers

Interview Tips