Question 1

What is Operational Transformation and how does it resolve concurrent edits?

Accepted Answer

Operational Transformation (OT) is the algorithm that enables Google Docs-style concurrent editing. Each edit is expressed as an operation: INSERT(position, text) or DELETE(position, length). When two users make concurrent edits, their operations are generated against the same document state. When one operation arrives at the server after the other has already been applied, the server transforms the incoming operation against the already-applied operation to account for the position shift it caused. Example: doc is "AC". User A inserts "B" at position 1: INSERT(1,"B"). User B inserts "X" at position 1: INSERT(1,"X"). Server receives A first, applies it → "ABC". B's op arrives. Transform B's INSERT(1,"X") against A's INSERT(1,"B"): A's insert is at position 1, B's is also at position 1 — by convention, A wins and B shifts right: INSERT(2,"X"). Apply → "ABXC". B's client receives A's op, transforms it against its own op in the same way, arrives at "ABXC". Both converge.

Question 2

What is the difference between OT and CRDTs for collaborative editing?

Accepted Answer

Operational Transformation (OT): requires a central server to serialize concurrent operations and transform them. All clients connect to the server; the server is the arbiter of operation order. Benefits: well-understood for text editing, used by Google Docs. Drawbacks: requires server coordination; OT transformation functions are complex and error-prone for rich text (beyond plain text). CRDTs (Conflict-free Replicated Data Types): each character gets a globally unique identifier embedded at creation time. Identifiers encode ordering without needing transformation — any order of applying operations converges to the same result (commutativity). No central coordinator required: clients can exchange operations peer-to-peer. Used by: Figma, Linear, Automerge. Drawback: documents grow in size as tombstoned (deleted) characters accumulate metadata permanently — requires periodic compaction (garbage collection of acknowledged deletions). For a software engineering interview: OT is the right answer when "central server" is acceptable; CRDTs when "peer-to-peer" or "offline-first" is required.

Question 3

How does version history work in a collaborative document editor?

Accepted Answer

Version history is a consequence of the append-only operation log architecture. Every edit is an immutable operation appended to a log: the log is the complete history. To view the document at any past revision: replay operations from the beginning up to that revision. To avoid O(N) replay time for large documents: take periodic snapshots — serialize the full document state every 1000 operations. On load: find the most recent snapshot before the target revision, then apply only the operations since that snapshot. The operation log also enables: (1) undo/redo — undo is a new inverse operation appended to the log (not a log rollback), maintaining a clean audit trail; (2) named versions ("Version before review") — tag specific revision numbers; (3) blame/diff — compute diff between any two revisions by comparing the operations between them. Storage: store snapshots in blob storage (S3), operation log in an append-optimized DB (Cassandra with time-series partitioning, or PostgreSQL with range partitioning by revision number).

System Design Interview: Design Google Docs / Collaborative Editing

What Is Collaborative Document Editing?

The Concurrency Problem

Operational Transformation (OT)

CRDTs (Conflict-free Replicated Data Types)

System Architecture (OT-based, Google Docs style)

Offline Editing and Sync

Persistent Storage

Interview Tips