Question 1

What is the difference between a service mesh and an API gateway?

Accepted Answer

An API gateway handles north-south traffic (external clients to internal services): authentication, rate limiting, SSL termination, routing, API versioning. It is an entry point — one per cluster or per API surface. A service mesh handles east-west traffic (service to service inside the cluster): mTLS, retries, circuit breaking, distributed tracing, canary deployments. It operates transparently via sidecar proxies without changing application code. They complement each other: API gateway at the edge, service mesh inside the cluster. Kong and AWS API Gateway for external traffic; Istio/Linkerd for internal. Confusion arises because both do routing and traffic management — the scope is what differs.

Question 2

How does Envoy implement circuit breaking and what are the configuration parameters?

Accepted Answer

Envoy circuit breaking operates at the connection pool and outlier detection level. Connection pool limits: max_connections (TCP), max_pending_requests (HTTP/1), max_requests (HTTP/2), max_retries — these prevent a single slow upstream from consuming all resources. Outlier detection (circuit breaker): consecutiveGatewayErrors (5xx count before ejection), interval (evaluation window, e.g., 10s), baseEjectionTime (how long the host is ejected, e.g., 30s), maxEjectionPercent (max fraction of hosts ejectable at once, e.g., 50%). When a host exceeds consecutiveErrors within the interval, it is ejected from the load balancing pool. After baseEjectionTime, one probe request is sent — if successful, the host rejoins. This is the half-open state in standard circuit breaker terminology.

Question 3

How does mTLS work in a service mesh and what identity system does it use?

Accepted Answer

In a service mesh, every workload gets a cryptographic identity via SPIFFE (Secure Production Identity Framework For Everyone). The identity is encoded in an X.509 certificate as a SPIFFE URI: spiffe://cluster.local/ns/default/sa/payment-service. The control plane (Istio's istiod or SPIRE) acts as a certificate authority: it signs short-lived certificates (default 24h) for each service account. Envoy sidecars handle TLS handshakes — both sides present and validate certificates. mTLS provides: (1) encryption of all traffic, (2) strong service identity — payment-service can only accept connections from authorized callers, not just any pod in the cluster, (3) zero-trust networking — network-level firewall rules alone are no longer sufficient. Certificate rotation is automatic and transparent to the application process.

System Design Interview: Microservices and Service Mesh (Envoy, Istio, mTLS)

What Is a Service Mesh?

Why Microservices Fail Without a Mesh

Sidecar Proxy Architecture

Service Discovery

mTLS — Mutual TLS

Circuit Breaker in the Mesh

Traffic Management

Observability

Interview Framework