Question 1

How does content versioning work in a CMS?

Accepted Answer

Content versioning stores every save as an immutable snapshot. Schema: content table (the current metadata) + content_versions table (field_values JSON per version). On each save: INSERT a new content_version row with the current field values and increment the version_number. Never update or delete existing versions -- they are the immutable audit trail. The current published version is referenced by content.published_version_id. Restoring an old version: create a new version with the same field_values as the target old version (new row, new version_number). This preserves the restore event in the history. Auto-save: save a draft version every 60 seconds while the editor is typing, but only show named (user-triggered) saves in the version history UI to avoid cluttering it with hundreds of auto-save entries.

Question 2

How do you implement a draft-to-publish workflow with approvals?

Accepted Answer

Workflow states: DRAFT -> IN_REVIEW -> APPROVED -> PUBLISHED (or REJECTED -> DRAFT). Role actions: AUTHOR creates a draft, submits for review (transition to IN_REVIEW, assigns to an EDITOR). EDITOR reviews: can approve (transition to APPROVED) or reject with comments (transition back to DRAFT). ADMIN or EDITOR publishes the approved content. Notifications: on state transition, notify the relevant user (EDITOR notified when review is requested; AUTHOR notified when approved or rejected). Workflow history: store each transition (from_state, to_state, actor_id, comment, timestamp) for audit purposes. Multi-step workflows: some enterprises require two approvals (junior editor + senior editor). Model as a workflow_steps table with ordered steps and separate approval tracking per step.

Question 3

How do you store and render rich text content safely?

Accepted Answer

Never store user-provided HTML directly -- XSS risk if the HTML is rendered without sanitization. Instead: (1) Store content as structured JSON (ProseMirror, Slate, or Lexical document model). The JSON represents the document tree (nodes: paragraph, heading, bold text, links). (2) On rendering: convert the JSON to HTML using a trusted renderer (server-side or client-side library). The renderer never allows arbitrary HTML injection -- it only renders the node types defined in the schema. (3) If raw HTML must be accepted (legacy data): sanitize with a library (DOMPurify for client-side, html-sanitizer for server-side) before storage. Allowlist: permit only safe tags and attributes (p, h1-h6, strong, em, a[href], img[src, alt]). Strip: script, iframe, on* event handlers, javascript: URLs. Storing structured JSON gives the best security and extensibility.

Question 4

How do you implement scheduled content publishing?

Accepted Answer

Scheduled publishing sets a future publish time. On schedule: set content.status = SCHEDULED, content.scheduled_at = future_datetime. A scheduler job (cron or Airflow DAG) runs every minute: SELECT content WHERE status='SCHEDULED' AND scheduled_at

Question 5

How do you handle multi-site or multi-language publishing?

Accepted Answer

Multi-site: one CMS instance serves multiple sites (e.g., US, UK, EU brands). Content may be shared or site-specific. Model: site table + content_site_mapping (content_id, site_id, site_specific_overrides JSON). On publish: specify which sites to publish to. Each site gets its own CDN path (/us/, /uk/). Shared content: base content in the main content record. Site-specific overrides stored in content_site_mapping. Multi-language (i18n): content has a default_locale and translations. Translation table: (content_id, locale, field_values JSON). Example: content_id=42 has an English version (field_values) and a French translation (field_values with French text). Translation workflow: content is created in the default locale, sent to translators (via a translation management system integration or manual assignment), approved translations are published per locale. URL structure: /en/article-slug, /fr/article-slug.

Low-Level Design: Content Management System — Drafts, Versioning, Roles, and Publishing Workflow

Core Entities

Content Versioning

Role-Based Permissions

Rich Text Storage

Publishing Workflow and Scheduling

Media Management