Question 1

How do you choose the right shard key for a social media application?

Accepted Answer

For a social media app, most queries are user-scoped: get my posts, my followers, my feed. The natural shard key is user_id. This co-locates all data for a user on one shard: profile, posts, follows, likes. Queries for a user's own data never cross shards. Trade-off: queries across users (trending posts, mutual friend suggestions) require scatter-gather. Hot user problem: a celebrity with 100M followers receives far more traffic than average. Mitigation: split hot users across shards by appending a random suffix (celebrity_123_shard_0 through _9), spreading their data across 10 shards. Reads scatter across all 10 and merge. Follow relationships between different users do go cross-shard, but this is acceptable if kept in a separate graph store (Neo4j, or a dedicated follows table sharded differently).

Question 2

How does consistent hashing minimize data movement when adding shards?

Accepted Answer

Naive hash sharding (hash % N): adding one shard changes N, remapping almost all keys to different shards. Consistent hashing: place shard servers on a ring of hash values (0 to 2^32). Each key is hashed to a position on the ring and assigned to the next clockwise server. Adding a server: only the keys between the new server and its predecessor are remapped -- roughly 1/N of all keys, not all of them. Virtual nodes: each physical server occupies multiple positions on the ring (100-200 virtual nodes per server). This distributes load more evenly and reduces the variance in key distribution. Used by: Cassandra, DynamoDB, Amazon S3. The key insight: consistent hashing minimizes data migration to O(K/N) keys when adding one of N nodes, where K = total keys.

Question 3

How do you handle cross-shard transactions?

Accepted Answer

Cross-shard transactions (e.g., transferring money between two users on different shards) cannot use a single ACID database transaction. Options: (1) Two-Phase Commit (2PC): coordinator asks all shards to prepare, then commit. Provides ACID guarantees but slow (two round trips), and a coordinator crash can leave shards locked indefinitely. (2) Saga pattern: decompose the transaction into a sequence of local transactions, each with a compensating action. If step 2 fails: run compensating actions for step 1 to undo. Eventually consistent -- there is a window where the system is in a partially applied state. (3) Avoid cross-shard transactions by design: place related data on the same shard. For money transfers between users on different shards: use an intermediate ledger service that handles the cross-shard coordination.

Question 4

What is the difference between sharding and partitioning?

Accepted Answer

Partitioning is a broad term: dividing data into subsets based on some criterion. Sharding is a specific type of horizontal partitioning where each partition (shard) lives on a separate database server (or cluster). Vertical partitioning: split columns across tables (user_profile table + user_activity table for the same user -- separate frequently accessed columns from infrequently accessed ones). Horizontal partitioning (sharding): split rows across servers based on a shard key. Within a single database: table partitioning (PostgreSQL, MySQL) splits a table into physical partitions on the same server for query performance (partition pruning), but does not scale writes across servers. Sharding refers specifically to the multi-server case.

Question 5

How do you rebalance data when a shard becomes overloaded?

Accepted Answer

When one shard is a hotspot (too much data or traffic): split it. Steps: (1) Choose a split point for the shard's key range. (2) Create a new empty shard. (3) Copy data from the hot shard to the new shard for the key range to be migrated. (4) Use dual writes: write new data to both old and new shards during migration. (5) Verify data consistency (checksums or row counts). (6) Cut over: update the routing table to point the migrated key range to the new shard. (7) Stop dual writes. (8) Clean up data from the old shard. During migration, reads are served from both shards (read new, fall back to old). The entire process can be done online with zero downtime. Tools: Vitess (for MySQL), Citus (for PostgreSQL) automate this process.

System Design: Database Sharding — Horizontal Partitioning, Shard Keys, Hotspots, and Resharding

Why Sharding?

Shard Key Selection

Sharding Strategies

Cross-Shard Queries

Hotspots

Resharding

Interview Tips