Q: What are CRDTs and how do they differ from Operational Transformation?

CRDTs (Conflict-free Replicated Data Types) are data structures where concurrent operations always commute: applying them in any order produces the same result. For text, RGA (Replicated Growable Array) gives each character a globally unique ID (site_id, logical_timestamp). Operations reference characters by ID, not position. Since IDs are stable, Insert("a" after char_ID_42) and Delete(char_ID_17) can be applied in any order and produce the same document. Key differences from OT: (1) CRDTs require no central server for conflict resolution - they enable true peer-to-peer sync; (2) OT is simpler but requires a server to serialize operations; (3) CRDTs work offline - operations accumulate locally and sync when reconnected; (4) CRDTs use more memory (tombstoned deletes must be retained). Used by Figma, Notion, Linear. OT used by Google Docs.

Q: How do you implement cursor presence in a real-time collaborative editor?

Each client broadcasts cursor position every 500ms via WebSocket: {user_id, document_id, cursor_position, user_name, color}. Cursor position is an offset in the document (or character ID in CRDT systems). The server receives these, stores them in Redis with a 10-second TTL (key: presence:doc:{doc_id}:user:{user_id}), and broadcasts to all other clients in the same document session. When a remote edit shifts text, the client must adjust displayed cursor positions: if an insert happens before a cursor, shift the cursor right; if a delete happens before a cursor, shift left. This is the same transform logic as OT applied to cursor positions. For large documents with 100 concurrent editors, cursor presence generates 100 * 2 = 200 messages/second per document, well within WebSocket capacity.

Question 1

How does Operational Transformation resolve concurrent edits in Google Docs?

Accepted Answer

When two users edit simultaneously, their operations have conflicting positions. OT transforms each operation against concurrent operations to adjust positions. Example: "Hello World". User A inserts comma at position 5: Insert(5, ","). User B deletes "o" at position 4: Delete(4). Applied naively in different orders, these produce different results. OT transform(Delete(4), Insert(5,",")): since Delete(4) < Insert(5), shift Insert right: Insert(6, ","). Now applying both in either order produces the same result: "Hell, World". The server is the authority: it receives all operations, transforms each against the current document revision, applies them in order, and broadcasts transformed operations to other clients. Clients send their operation plus the revision they last saw, allowing the server to transform against any intervening operations.

Question 2

What are CRDTs and how do they differ from Operational Transformation?

Accepted Answer

CRDTs (Conflict-free Replicated Data Types) are data structures where concurrent operations always commute: applying them in any order produces the same result. For text, RGA (Replicated Growable Array) gives each character a globally unique ID (site_id, logical_timestamp). Operations reference characters by ID, not position. Since IDs are stable, Insert("a" after char_ID_42) and Delete(char_ID_17) can be applied in any order and produce the same document. Key differences from OT: (1) CRDTs require no central server for conflict resolution - they enable true peer-to-peer sync; (2) OT is simpler but requires a server to serialize operations; (3) CRDTs work offline - operations accumulate locally and sync when reconnected; (4) CRDTs use more memory (tombstoned deletes must be retained). Used by Figma, Notion, Linear. OT used by Google Docs.

Question 3

How do you implement cursor presence in a real-time collaborative editor?

Accepted Answer

Each client broadcasts cursor position every 500ms via WebSocket: {user_id, document_id, cursor_position, user_name, color}. Cursor position is an offset in the document (or character ID in CRDT systems). The server receives these, stores them in Redis with a 10-second TTL (key: presence:doc:{doc_id}:user:{user_id}), and broadcasts to all other clients in the same document session. When a remote edit shifts text, the client must adjust displayed cursor positions: if an insert happens before a cursor, shift the cursor right; if a delete happens before a cursor, shift left. This is the same transform logic as OT applied to cursor positions. For large documents with 100 concurrent editors, cursor presence generates 100 * 2 = 200 messages/second per document, well within WebSocket capacity.

System Design Interview: Design a Real-Time Collaborative Editor (Google Docs)

System Design Interview: Design a Real-Time Collaborative Editor (Google Docs)

Requirements

The Core Problem: Concurrent Edits

Operational Transformation (OT)

OT Server Architecture

CRDTs (Conflict-free Replicated Data Types)

Presence and Cursor Awareness

Document Storage

Interview Tips