Question 1

How do you implement geospatial search for properties within a radius?

Accepted Answer

Use a spatial index -- the standard approach is PostGIS in PostgreSQL. Store each property's location as a geometry point: ST_SetSRID(ST_MakePoint(lng, lat), 4326). Create a GIST index on the location column. For radius search: SELECT * FROM properties WHERE ST_DWithin(location::geography, ST_MakePoint(:lng, :lat)::geography, :radius_meters) AND status = FOR_SALE ORDER BY ST_Distance(location::geography, ST_MakePoint(:lng, :lat)::geography) ASC LIMIT 20. The geography type handles Earth curvature for accurate distances. ST_DWithin with a spatial index is O(log n + k) where k is the result count. For even larger scale: use Elasticsearch with geo_distance queries. Elasticsearch distributes across nodes and handles hundreds of millions of documents.

Question 2

How do you handle concurrent updates to property status (AVAILABLE u2192 SOLD)?

Accepted Answer

Property status changes (listing u2192 sale) can have race conditions: two buyers submit offers simultaneously. Optimistic locking: include a version number on the property record. On update: UPDATE properties SET status=UNDER_CONTRACT, version=version+1, buyer_id=X WHERE property_id=P AND status=FOR_SALE AND version=:current_version. If rows_affected == 0: another transaction modified the record -- retry or return a conflict error. This prevents two buyers from both "winning" the property. For the final status change to SOLD: require an agent action with a signed purchase agreement document ID. Audit log: record every status change with timestamp, user_id, and reason -- essential for disputes. Status transitions should form a state machine: FOR_SALE u2192 UNDER_CONTRACT u2192 SOLD (no backward transitions except with explicit agent override).

Question 3

How do you implement saved search alerts efficiently at scale?

Accepted Answer

Naive approach: when a new listing is added, scan all saved searches and evaluate each filter. With 10M saved searches and 10K new listings per day: 100B filter evaluations -- too slow. Efficient approach: inverted index on key filter dimensions. For each (city, property_type, price_bucket) combination, maintain a list of saved_search_ids that match. On new listing: look up saved searches for the listing's (city, type, price_bucket). Evaluate remaining filters (bedrooms, features) only for those matching searches. This reduces evaluations by 99%. Implementation: store the inverted index in Redis as sets: SADD searches:city:NYC:type:CONDO:price:500k-750k search_id. On new listing, SMEMBERS for the matching bucket, evaluate each returned search. Send alerts via a job queue (one job per alert to avoid blocking).

Question 4

How do you design property photo upload and display at scale?

Accepted Answer

Upload flow: client requests a pre-signed S3 URL from the API (valid for 15 minutes). Client uploads photo directly to S3 -- never through the server. S3 triggers a Lambda on upload completion. Lambda: (1) validates the file (MIME type check, virus scan), (2) calls a resize function to generate thumbnails: 1200x800 for full-view, 400x267 for list cards, 200x133 for map thumbnails. Store thumbnails in S3 under predictable keys: photos/{property_id}/{photo_id}/{size}.webp. Serve via CloudFront CDN -- edge-cached globally. Photo ordering: stored as order_index on the photo record. Drag-and-drop reordering updates order_index values in batch. Primary photo (first in order) is used as the listing card image. Lazy load non-primary photos on scroll.

Question 5

How do you handle search ranking for property listings?

Accepted Answer

Simple: sort by price (ascending/descending), date listed (newest), or distance. Better: relevance ranking. Score each property: freshness_score (recently listed properties rank higher -- decay function on days_since_listed), completeness_score (properties with more photos, virtual tours, and description length rank higher -- better listings convert better), match_score (how well the property matches the search criteria -- exact bedroom match scores higher than inexact), agent_response_rate (agents who respond to inquiries quickly get a boost). Combine: final_score = 0.4 * freshness + 0.3 * match + 0.2 * completeness + 0.1 * agent_quality. Store the pre-computed score on the property record, updated nightly. Use Elasticsearch function_score query to blend the relevance score with filter results.

Low-Level Design: Real Estate Listing Platform — Property Search, Geospatial Queries, and Agent Matching

Core Entities

Property Search and Filtering

Map-Based Search

Agent Matching

Saved Searches and Alerts

Media and Virtual Tours