Question 1

What is the cache-aside pattern and why delete on update instead of update the cache?

Accepted Answer

Cache-aside (lazy loading): on read, check cache first; on miss, load from DB and populate cache. On write, update DB first, then invalidate (delete) the cache key. Why delete instead of update: if two concurrent writes both update the cache, the final cache value depends on which write lands last, which may not match the DB (due to network ordering). Example: thread A writes value 1 to DB and cache; thread B writes value 2 to DB first, then A's cache write overwrites with stale value 1. Deleting the cache key on write forces the next read to reload from DB, which always has the correct current value. The delete-on-write + lazy-load-on-read pattern avoids this race condition at the cost of one extra DB read after each write.

Question 2

What is consistent hashing and why does it minimize key remapping when cache nodes change?

Accepted Answer

Naive sharding (key % N nodes): when a node is added or removed, N changes and nearly all keys remap to different nodes, causing a cache cold start (mass invalidation). Consistent hashing maps both keys and nodes to positions on a virtual ring (0 to 2^32). A key is served by the first node clockwise from its position. When a node is removed: only the keys between the removed node and its predecessor remap (approximately 1/N of all keys). When a node is added: only the keys between the new node and its predecessor remap. This bounds disruption to 1/N of keys regardless of total keys. Virtual nodes (each physical node mapped to K positions on the ring) improve load distribution — without them, non-uniform node placement creates hotspots.

Question 3

How do you prevent a cache stampede when a popular key expires?

Accepted Answer

Cache stampede: a popular key expires; hundreds of concurrent requests all miss and query the database simultaneously, overwhelming it. Prevention strategies: (1) Mutex: first request to detect a miss acquires a Redis lock (SETNX key_lock 1 PX 5000). Only the lock holder queries the DB and repopulates the cache. Other requests either spin-wait briefly or return stale data if available. (2) Probabilistic early expiration: before TTL expires, with probability proportional to staleness factor, preemptively recompute. This spreads recomputation across multiple requests and time. (3) Background refresh: a background job proactively refreshes keys before expiry based on access frequency. Hot keys are never actually expired - they are continuously renewed while they receive traffic. Strategy 3 is the most robust for very high-traffic keys.

Question 4

What is the difference between write-through and write-behind caching?

Accepted Answer

Write-through: on every write, update the cache AND the database synchronously before returning to the caller. The cache always has the latest data; no risk of cache-DB inconsistency. Downside: every write has the latency of both a cache write and a DB write; write-heavy workloads get no benefit. Write-behind (write-back): on write, update the cache immediately and return to the caller; the DB write happens asynchronously in the background (buffered). Much lower write latency. Downside: if the cache crashes before the async DB write completes, data is lost. Use write-through when data consistency is critical (financial data). Use write-behind when write throughput matters more than durability (activity counters, session data, analytics events where some loss is acceptable).

Question 5

How does Redis handle memory pressure when the cache is full?

Accepted Answer

Redis supports several maxmemory-policy options: noeviction (return error on write when full), allkeys-lru (evict LRU key from all keys), volatile-lru (evict LRU key only among keys with TTL set), allkeys-random (evict random key), volatile-ttl (evict key with shortest TTL first), allkeys-lfu (evict least frequently used, Redis 4+). Redis does not use a true LRU (scanning all keys to find LRU is too slow). Instead: allkeys-lru samples a configurable number of random keys (default 5, tunable with maxmemory-samples) and evicts the least recently used among the sample. Higher sample counts are more accurate but slower. For most use cases: allkeys-lru with maxmemory-samples=10 provides a good balance. For skewed access patterns (20% of keys receive 80% of traffic), allkeys-lfu avoids evicting popular keys that happen to be LRU.

System Design: Distributed Cache — Eviction Policies, Consistency, and Scaling (2025)

Why Distributed Caching?

Eviction Policies

Cache Aside (Lazy Loading) Pattern

Consistent Hashing for Cache Sharding

Cache Stampede and Thundering Herd Prevention