Question 1

What is the fanout problem in social media news feed design?

Accepted Answer

Fanout describes how a post reaches all followers. Fanout on write (push model): when a user posts, immediately write the post to every follower's feed cache. Pros: O(1) feed reads. Cons: a celebrity with 10M followers requires 10M writes per post — write amplification makes this unusable at scale. Fanout on read (pull model): when a user requests their feed, query all followees and merge their recent posts. Pros: no write amplification. Cons: users who follow thousands of accounts see O(N followees) queries per feed load. The industry solution is a hybrid: use fanout-on-write for regular users (under ~10K followers), and fanout-on-read for celebrities (over ~10K followers). At feed load time, pre-built regular feed from Redis is merged with celebrity posts fetched on demand. This is how Facebook, Instagram, and Twitter actually work.

Question 2

How does a news feed ranking algorithm work?

Accepted Answer

Modern feeds are not chronological — they use ML ranking. The ranking model estimates, for each user-post pair, the probability that the user will engage (like, comment, share). Key ranking signals: (1) Affinity score — how much the user interacts with this author (replies, DMs, profile views). Higher affinity = posts scored higher. (2) Content weight — based on historical engagement data, video scores higher than images, images higher than links, links higher than text. (3) Recency — a time decay function so newer posts score higher, but not purely chronological (a viral post from 6 hours ago beats a new post from an author you rarely engage with). (4) Social proof — posts liked by your close friends get a boost. In interviews, present a simple weighted formula: score = affinity*0.4 + content_weight*0.2 + recency_decay*0.4. Real systems use gradient boosted trees (Facebook's EdgeRank successor) or deep learning.

Question 3

How do you implement cursor-based pagination for a news feed?

Accepted Answer

Cursor-based pagination passes the last seen item as a cursor for the next page, rather than a numeric offset. For a news feed: the client sends the post_id (or timestamp) of the last post it received; the server fetches posts with created_at < cursor_timestamp using an index scan. Why not OFFSET? SQL OFFSET 100 scans and discards the first 100 rows before returning the next 20 — on a table with millions of rows, page 500 is extremely slow (100,020 rows scanned for 20 returned). Cursor pagination is O(1) regardless of page depth — the created_at index allows direct seek to the cursor position. Implementation: SELECT * FROM posts WHERE author_id IN (...) AND created_at < :cursor ORDER BY created_at DESC LIMIT 20. Return the last post's created_at as the next cursor. Handle ties by also including post_id in the cursor (sort by created_at DESC, post_id DESC) for stable pagination.

System Design Interview: Design a Social Media News Feed

System Design: News Feed (Facebook / Instagram / Twitter)

Requirements

Core Models

Fanout Strategies

Fanout on Write (Push Model)

Fanout on Read (Pull Model)

Hybrid Model (Industry Standard)

Feed Ranking

Storage

Scale Numbers

Interview Tips