Q: How do you handle platform-specific content constraints?

Each platform has different limits: Twitter: 280 characters (URLs count as 23 chars regardless of length), 4 images or 1 video. Instagram: 2200 character caption, requires at least one image or video, no clickable links in captions. LinkedIn: 3000 characters, supports documents/articles. Facebook: 63,206 characters. Validation approach: validate against platform constraints before scheduling (not at publish time -- fail fast). Store a content_warnings field per platform showing constraint violations. Allow the user to edit content per-platform (LinkedIn version vs Twitter version of the same post). At publish time, validate again in the adapter in case constraints changed -- return a clear error if the platform API rejects the content.

Q: How do you implement best-time recommendations for posting?

Collect historical engagement data: for each published post, store publish_time (day_of_week, hour_of_day) and engagement metrics (likes, comments, shares, impressions). For each user, compute average engagement rate per (platform, day_of_week, hour_of_day) bucket. A user with 90 days of post history has data across 168 time buckets (7 days * 24 hours). Recommend top-3 buckets by average engagement rate. Cold start (new users): use global averages or platform-level research data (LinkedIn peaks Tuesday-Thursday 9am, Twitter peaks mornings/evenings). Update recommendations monthly as new post data accumulates. Show the recommendation with confidence level -- a bucket with 2 data points is less reliable than one with 20.

Q: How would you scale this system to support millions of scheduled posts?

Partition the posts table by scheduled_at date -- queries for today's due posts only scan one partition. Use a dedicated job queue (Celery, Temporal, AWS SQS) instead of DB polling -- the scheduler enqueues publish tasks when a post becomes due; workers pull from the queue. This decouples scheduling (DB) from execution (workers) and handles bursty traffic better. For high-throughput publishing: separate worker pools per platform (Twitter workers, Instagram workers) with platform-specific concurrency limits. Rate limit tracking per platform account: use Redis counters to track API calls per time window and throttle workers accordingly. Store platform API credentials in a secrets manager (AWS Secrets Manager, HashiCorp Vault) rather than in the database.

Question 1

How do you ensure a scheduled post is published exactly once with multiple workers?

Accepted Answer

Use FOR UPDATE SKIP LOCKED in the worker query: SELECT * FROM posts WHERE status = SCHEDULED AND scheduled_at

Question 2

How do you handle OAuth token expiry for multiple social platforms?

Accepted Answer

Store access_token, refresh_token, and token_expires_at per PublishingAccount. Before each publish: check if token_expires_at < NOW() + 5 minutes. If so: call the platform token refresh endpoint with the refresh_token. Update access_token and token_expires_at in the database. Use the fresh token for the publish. If the refresh fails (refresh_token also expired or revoked): mark the account as REQUIRES_REAUTH and notify the user. Never hardcode token lifetimes -- different platforms have different expiry (Twitter never expires, Instagram: 60 days, LinkedIn: 60 days). Some platforms (Instagram) require proactive token refresh before expiry via a cron job (refresh tokens that expire within 30 days).

Question 3

How do you handle platform-specific content constraints?

Accepted Answer

Each platform has different limits: Twitter: 280 characters (URLs count as 23 chars regardless of length), 4 images or 1 video. Instagram: 2200 character caption, requires at least one image or video, no clickable links in captions. LinkedIn: 3000 characters, supports documents/articles. Facebook: 63,206 characters. Validation approach: validate against platform constraints before scheduling (not at publish time -- fail fast). Store a content_warnings field per platform showing constraint violations. Allow the user to edit content per-platform (LinkedIn version vs Twitter version of the same post). At publish time, validate again in the adapter in case constraints changed -- return a clear error if the platform API rejects the content.

Question 4

How do you implement best-time recommendations for posting?

Accepted Answer

Collect historical engagement data: for each published post, store publish_time (day_of_week, hour_of_day) and engagement metrics (likes, comments, shares, impressions). For each user, compute average engagement rate per (platform, day_of_week, hour_of_day) bucket. A user with 90 days of post history has data across 168 time buckets (7 days * 24 hours). Recommend top-3 buckets by average engagement rate. Cold start (new users): use global averages or platform-level research data (LinkedIn peaks Tuesday-Thursday 9am, Twitter peaks mornings/evenings). Update recommendations monthly as new post data accumulates. Show the recommendation with confidence level -- a bucket with 2 data points is less reliable than one with 20.

Question 5

How would you scale this system to support millions of scheduled posts?

Accepted Answer

Partition the posts table by scheduled_at date -- queries for today's due posts only scan one partition. Use a dedicated job queue (Celery, Temporal, AWS SQS) instead of DB polling -- the scheduler enqueues publish tasks when a post becomes due; workers pull from the queue. This decouples scheduling (DB) from execution (workers) and handles bursty traffic better. For high-throughput publishing: separate worker pools per platform (Twitter workers, Instagram workers) with platform-specific concurrency limits. Rate limit tracking per platform account: use Redis counters to track API calls per time window and throttle workers accordingly. Store platform API credentials in a secrets manager (AWS Secrets Manager, HashiCorp Vault) rather than in the database.

Low-Level Design: Social Media Post Scheduler — Scheduling, Multi-Platform Publishing, and Analytics

Core Entities

Scheduling Architecture

Multi-Platform Publishing

Platform Adapters

Failure Handling and Retry

Analytics Collection

Timezone and Optimal Time Recommendations