Question 1

How does reservoir sampling guarantee uniform probability?

Accepted Answer

Reservoir sampling guarantees that each element has equal probability k/n of being in the final sample, even though n is unknown during processing. Proof for k=1 (select 1 item from n): item i is selected with probability 1/i (it wins the random draw at step i). It survives each subsequent step j > i with probability (j-1)/j (the new item does NOT replace it). Product of survival probabilities: (i/(i+1)) * ((i+1)/(i+2)) * ... * ((n-1)/n) = i/n. Combined with selection probability 1/i: overall probability = 1/i * i/n = 1/n. Every item has probability 1/n -- uniform. For k > 1, the same telescoping product argument applies, giving each item probability k/n.

Question 2

What is the expected time complexity of QuickSelect and why?

Accepted Answer

QuickSelect finds the k-th smallest element in expected O(n) time with random pivot selection. At each step, the pivot splits the array into two parts. We recurse only on the relevant part (the one containing the k-th element). Expected recurrence: if the pivot lands in the middle half (probability 1/2), we recurse on at most 3n/4 elements. T(n) = n + T(3n/4) with probability 1/2, giving T(n) = O(n) in expectation (geometric series: n + 3n/4 + 9n/16 + ... = 4n). More precisely, expected comparisons = 2n + o(n). Compare to worst case O(n^2) when the pivot is always the minimum or maximum -- random pivot selection makes this event exponentially unlikely.

Question 3

What is a skip list and how does it compare to a balanced BST?

Accepted Answer

A skip list is a probabilistic sorted linked list with multiple levels. The bottom level has all elements. Each element is promoted to the next level with probability 1/2. Higher levels act as express lanes for faster traversal. Expected height: O(log n). Search: descend from the top level, move right as long as the next node is less than or equal to the target, drop down when the next node exceeds it. Expected O(log n) per operation. Compared to balanced BSTs (AVL, red-black): skip lists are simpler to implement correctly (no rotations), support concurrent operations more easily (lock-coupling on fewer nodes), and support range queries naturally. Red-black trees have deterministic O(log n) while skip lists have expected O(log n). Redis sorted sets use skip lists.

Question 4

What is the difference between Monte Carlo and Las Vegas algorithms?

Accepted Answer

Las Vegas algorithms always produce the correct output; only their running time is random. Examples: randomized QuickSort (always sorts correctly, expected O(n log n)), QuickSelect (always finds the correct element), randomized hashing. The guarantee is on expected runtime. Monte Carlo algorithms always run in bounded deterministic time but may produce incorrect results with some probability. Examples: Miller-Rabin primality test (may incorrectly classify a composite as prime with probability at most 1/4 per round), approximate counting, Monte Carlo integration. Reduce error by repeating: k independent Monte Carlo rounds reduce error probability to (1/4)^k. For interviews: randomized sorting and selection are Las Vegas; probabilistic primality tests are Monte Carlo.

Question 5

How is randomized hashing used to prevent worst-case hash collisions?

Accepted Answer

A deterministic hash function can be attacked: an adversary knowing the hash function can craft inputs that all map to the same bucket, causing O(n) lookup time (hash DoS). Solution: universal hashing. At startup, randomly choose a hash function from a family of universal hash functions -- the adversary does not know which one was chosen. A family H is universal if for any two distinct keys x,y: Pr[h(x)==h(y)]

Randomized Algorithms Interview Patterns: Reservoir Sampling, QuickSelect, Skip Lists (2025)

Why Randomized Algorithms?

Reservoir Sampling

QuickSelect: k-th Largest Element

Randomized QuickSort Analysis

Skip Lists

Monte Carlo vs Las Vegas