Q: What problems reduce to minimum spanning tree?

Network design: connect N cities with minimum cable length (classic MST). Find the minimum cost to make a graph connected. Cluster analysis: run MST, remove the K-1 most expensive edges to get K clusters (single-linkage hierarchical clustering). LC 1168 Water Distribution: add a virtual well node with edge weight = well digging cost; MST gives the optimal combination of wells and pipes. Bottleneck spanning tree: a spanning tree that minimizes the maximum edge weight. The MST is also the bottleneck spanning tree (the min bottleneck path between any two vertices in the MST equals the max edge weight on the MST path -- a property of MSTs). Max reliability spanning tree: maximize the product of edge reliabilities -- equivalent to maximum spanning tree on log(reliability) weights. Approximation algorithm: the MST is a 2-approximation for metric TSP (MST weight

Question 1

What is the difference between Kruskal and Prim for minimum spanning tree?

Accepted Answer

Kruskal: sorts all edges globally by weight, then greedily adds the cheapest edge that does not form a cycle (checked with Union-Find). O(E log E) time (dominated by sorting). Works on any representation. Best for sparse graphs (E close to V). Prim: grows the MST from a starting vertex. Uses a priority queue to always extend with the cheapest edge connecting the MST to a non-MST vertex. O((V+E) log V) with a binary heap. O(V^2) with an adjacency matrix (better for dense graphs). Best for dense graphs or when given an adjacency matrix. Both algorithms produce the same MST (or one of the possible MSTs if there are equal-weight edges). In interviews: Kruskal is simpler to implement with Union-Find and handles disconnected graphs naturally (produces a minimum spanning forest).

Question 2

How does Kruskal algorithm use Union-Find to avoid cycles?

Accepted Answer

Kruskal adds edges in non-decreasing weight order. Adding an edge (u,v) creates a cycle if and only if u and v are already in the same connected component. Union-Find tracks connected components efficiently. For each candidate edge (u,v,w): find(u) and find(v) return the component roots. If roots are equal: u and v are connected -- skip this edge (would form a cycle). If roots differ: this edge connects two different components -- add to MST, call union(u,v) to merge the components. This is the cycle property of MSTs: the cheapest edge crossing any cut (between two components) is always in the MST. Union-Find with path compression and union by rank gives O(alpha(V)) amortized per operation. Total time: O(E log E) sorting + O(E alpha(V)) union-find = O(E log E).

Question 3

How do you find the minimum spanning tree of a complete graph efficiently?

Accepted Answer

A complete graph has E = V*(V-1)/2 edges. Running Kruskal on all edges is O(V^2 log V). Prim with an adjacency matrix is O(V^2) -- optimal for complete graphs. LC 1584 Min Cost to Connect All Points is a classic example: given V points on a 2D plane, connect all with minimum total Manhattan distance. There are V^2/2 potential edges. Prim O(V^2): maintain min_dist[v] = minimum edge weight from v to the current MST. In each of V iterations, find the unvisited vertex with minimum min_dist (O(V) scan), add it to the MST, and update min_dist for its neighbors. Total: O(V^2). This beats Kruskal O(V^2 log V) for dense graphs. For the specific case of Euclidean MST (points in a plane), there are O(V log V) algorithms using Delaunay triangulation.

Question 4

What is the cut property and why does it prove MST algorithms are correct?

Accepted Answer

The cut property states: for any cut of the graph (partition of vertices into two non-empty sets S and V-S), the minimum weight edge crossing the cut is in some MST. Proof: suppose edge e = (u,v) is the minimum crossing edge but not in the MST T. T must have a path from u to v (since T is spanning). This path contains some edge e prime crossing the cut. Adding e to T creates a cycle; removing e prime breaks the cycle, giving a tree T prime with weight T - weight(e prime) + weight(e) < weight(T) (since e is cheaper than e prime). Contradiction -- T was not the MST. Why it proves Kruskal: when Kruskal adds an edge (u,v), the cut is (MST so far, remaining vertices). The edge (u,v) is the minimum crossing edge for some cut. Why it proves Prim: at each step, Prim adds the minimum edge from the current tree to any non-tree vertex.

Question 5

What problems reduce to minimum spanning tree?

Accepted Answer

Network design: connect N cities with minimum cable length (classic MST). Find the minimum cost to make a graph connected. Cluster analysis: run MST, remove the K-1 most expensive edges to get K clusters (single-linkage hierarchical clustering). LC 1168 Water Distribution: add a virtual well node with edge weight = well digging cost; MST gives the optimal combination of wells and pipes. Bottleneck spanning tree: a spanning tree that minimizes the maximum edge weight. The MST is also the bottleneck spanning tree (the min bottleneck path between any two vertices in the MST equals the max edge weight on the MST path -- a property of MSTs). Max reliability spanning tree: maximize the product of edge reliabilities -- equivalent to maximum spanning tree on log(reliability) weights. Approximation algorithm: the MST is a 2-approximation for metric TSP (MST weight

Minimum Spanning Tree Interview Patterns: Kruskal, Prim, and Network Design Problems (2025)

What Is a Minimum Spanning Tree?

Kruskal Algorithm

Prim Algorithm

Maximum Spanning Tree

Common Interview Problems

Kruskal vs Prim