Question 1

How does Raft consensus guarantee that all nodes agree on the same configuration value?

Accepted Answer

Raft ensures agreement through a leader-based log replication protocol. All writes go to the elected leader. The leader appends the write to its own log and sends AppendEntries RPCs to all followers. Once a majority (quorum) of nodes have appended the entry to their logs and acknowledged, the leader considers it committed and applies it to the state machine. The leader then notifies followers of the commit. Only committed entries are served to clients. Why this prevents disagreement: a minority of nodes (e.g., 1 of 3) can fail without affecting consensus. A quorum (majority) is required to both commit a write and elect a new leader. Any two quorums overlap by at least one node — so any newly elected leader is guaranteed to have seen all committed entries from the previous term. This overlap property prevents two different values from being committed for the same log index, ensuring all nodes eventually apply the same writes in the same order.

Question 2

How do watch subscriptions work without polling in an etcd-style system?

Accepted Answer

Watches use server-sent push over a persistent gRPC bidirectional stream. Client opens a Watch RPC: etcd.watch("/services/", revision=current_revision). The server registers this watcher in memory, keyed by the watched key/prefix. When any write to /services/* is committed, the server immediately pushes a WatchEvent to all matching watchers: {type: PUT, key, value, prev_value, mod_revision}. The client receives the event within milliseconds — no polling. Revision numbers are critical for reliability: if the client disconnects and reconnects, it resumes the watch from its last-seen revision, requesting all events since that revision. The server stores a compacted event history (configurable, e.g., last 10K events). Events older than the compaction window are gone; the client must re-read the current value and start a new watch. This is why etcd clients always re-read the key after establishing a new watch — to avoid the gap between the last-seen revision and the watch start.

Question 3

How do leases enable ephemeral keys for service discovery and leader election?

Accepted Answer

A lease is a time-to-live (TTL) object that any key can be attached to. When a service registers itself, it: (1) creates a lease with TTL=30s, getting back a lease_id; (2) writes its key with the lease_id attached. The key exists only while the lease is alive. The service must send KeepAlive RPCs every 10s to renew the lease TTL. If the service crashes: keepalives stop, the lease expires after 30s, and all keys attached to that lease are atomically deleted. Watchers of those keys receive DELETE events and update their view of available services. Why TTL instead of cleanup on crash: crash detection is unreliable in distributed systems (network partition looks the same as crash). A TTL-based lease provides eventual cleanup regardless of the failure mode. The TTL is the "worst case staleness" — after 30 seconds, the dead service's registration is gone. For leader election: the leader holds a lease; if it crashes, the lease expires and other candidates can CAS a new leader key into existence.

System Design Interview: Design a Configuration Management System (etcd/Consul)

What Is a Configuration Management System?

System Requirements

Functional

Non-Functional

Raft Consensus

Watch Mechanism

Feature Flags with Config System

Service Discovery

Compare-and-Swap for Leader Election

Scaling Reads

Interview Tips