Question 1

Why use sharded counters instead of a single Redis counter for high-traffic metrics?

Accepted Answer

A single Redis key is a hot key u2014 all writes contend for the same memory slot. Redis is single-threaded per key, so throughput is bounded by the speed of one thread. Sharded counters split the counter into N keys (shard_0, shard_1, ..., shard_N-1). Each increment goes to a random shard, eliminating contention. Read (MGET all shards + sum) requires N operations but can be done in one round-trip with MGET. With 16 shards, write throughput scales ~16x. YouTube uses sharded counters for view counts; the displayed count may lag by a few minutes.

Question 2

How does HyperLogLog work and when do you use it?

Accepted Answer

HyperLogLog (HLL) is a probabilistic data structure that estimates the count of unique elements using O(1) memory (12KB for any cardinality). It uses hash functions and tracks the maximum number of leading zeros across hash values u2014 more leading zeros = more unique elements seen. Redis commands: PFADD adds elements, PFCOUNT returns the estimated cardinality. Error rate: 0.81%. Use HLL when: you need approximate unique visitor counts, unique search queries per day, or distinct IPs u2014 any metric where 1% error is acceptable and memory matters. For exact counts, use a Redis SET but expect O(n) memory.

Question 3

What is the Lambda architecture and how does it handle real-time analytics?

Accepted Answer

Lambda architecture has three layers. Speed layer (real-time): streams events via Kafka to Flink or Spark Streaming, computes approximate metrics for recent time windows (last minute, last hour), stores in Redis. Batch layer (historical): Spark batch jobs process complete historical data (all events, no approximation), store in a data warehouse. Serving layer: merges batch results (accurate, for older data) with speed layer results (approximate, for recent data). When the batch job completes and covers a time period, the speed layer data for that period is discarded. Handles both accuracy and recency.

Question 4

How do you implement a real-time leaderboard that handles millions of score updates per second?

Accepted Answer

Use Redis Sorted Sets (ZADD/ZINCRBY). ZINCRBY leaderboard:game delta user_id atomically increments the score. ZREVRANGE leaderboard:game 0 9 WITHSCORES returns the top-10. ZREVRANK leaderboard:game user_id returns a user's rank (0-indexed). All operations are O(log n). For games with 10M players, the sorted set fits in ~300MB RAM. For daily/weekly leaderboards, use separate keys (leaderboard:game:2025-04-17) and expire them with TTL. Score updates in Redis are near-instantaneous; persist to the DB asynchronously every minute for durability.

Question 5

How do you implement a sliding window rate counter?

Accepted Answer

Use a Redis Sorted Set keyed by user ID. Each request adds a member with score = current_timestamp. On each request: (1) ZREMRANGEBYSCORE to remove entries older than window_start = now - window_seconds. (2) ZCARD to count entries in the window. (3) If count >= limit, reject. (4) ZADD to add the current request with score = now. (5) EXPIRE to clean up the key after the window. This implements a true sliding window (vs fixed window which allows 2x rate at window boundaries). Downside: O(log n) per request, and each request stores a Redis entry. For production, use a fixed-window counter for simplicity unless the boundary burst is a real problem.

System Design: Distributed Counters, Leaderboards, and Real-Time Analytics

The Problem: Counting at Scale

Approach 1: Redis INCR

Approach 2: Sharded Counters

Approach 3: Lambda Architecture

Approach 4: Time-Series Databases

Real-Time Leaderboards

Sliding Window Rate Counter

Exactly-Once Event Counting

Interview Questions

Q: How would you design YouTube’s view count system?