Question 1

What are the three tenancy models for a multi-tenant SaaS database and how do you choose?

Accepted Answer

Silo model (separate DB per tenant): strongest isolation, easy per-tenant compliance (GDPR data residency), easy tenant deletion (drop the DB), but operational complexity scales linearly with tenant count and resource utilization is poor (1000 tenants = 1000 databases). Use when: enterprise customers require data isolation guarantees, different tenants need different schema versions, or tenant deletion must be instant and complete. Bridge model (separate schema per tenant in shared DB): per-tenant schema in PostgreSQL, shared connection pool. Better resource sharing than silo. Moderate isolation. Operational cost grows with tenant count but slower than silo. Pool model (shared tables with tenant_id column): best resource utilization, simplest operations, but requires row-level security to prevent cross-tenant data leaks. Use for SMB SaaS with homogeneous tenants and no strict isolation requirements. Most SaaS products start with pool and add silo for enterprise customers who pay for it.

Question 2

How does PostgreSQL Row-Level Security (RLS) prevent cross-tenant data leaks?

Accepted Answer

PostgreSQL RLS lets you attach policies to tables that filter rows based on session variables or JWT claims. Setup: ALTER TABLE orders ENABLE ROW LEVEL SECURITY; CREATE POLICY tenant_isolation ON orders USING (tenant_id = current_setting('app.tenant_id')::uuid). At connection time, the application sets: SET LOCAL app.tenant_id = 'tenant-abc-123'. All subsequent queries on the orders table are automatically filtered to only rows where tenant_id = 'tenant-abc-123' — even a SELECT * FROM orders only returns that tenant's rows. The application cannot forget to add WHERE tenant_id = ? because the database enforces it. Important: use a dedicated non-superuser application DB role, since superusers bypass RLS. Audit: run SHOW row_security on the table to verify it is enabled. Test by setting tenant_id to a different value and confirming zero rows returned for another tenant's data.

Question 3

How do you safely run schema migrations across 10,000 tenant databases?

Accepted Answer

Rolling migration strategy for silo/bridge models: never run on all tenants simultaneously. Phase 1: canary — run on 1% of tenants (smallest, least critical). Monitor error rates and query performance for 24 hours. Phase 2: wave rollout — 10%, 25%, 50%, 100% with monitoring gates between waves. Tools: tenant migration tracker table logs migration version per tenant; migration runner queries for tenants at version N and upgrades to N+1. For shared-table (pool) model: use backward-compatible migrations only. Never drop a column in the same migration as removing its usage — do it in three steps: (1) add new column and dual-write, (2) backfill old rows, (3) cut over reads, (4) drop old column in a separate later migration. This allows rolling deploys where old and new app versions coexist. Blue-green deploys: apply migration to standby DB, switch traffic, keep old DB as rollback. Works well for large tenants where downtime is unacceptable.

System Design Interview: Design a Multi-Tenant SaaS Platform

What Is a Multi-Tenant SaaS Platform?

Tenancy Models

Model 1: Silo (Database-per-Tenant)

Model 2: Shared Database, Separate Schemas

Model 3: Shared Database, Shared Schema (Row-Level)

Hybrid Approach (Real-World Standard)

Tenant Routing Layer

Data Isolation Enforcement

Tenant Provisioning

Schema Migrations at Scale

Resource Quotas and Noisy Neighbor

Per-Tenant Customization

Interview Tips