Load Balancing
Category: infrastructure
The process of distributing incoming network traffic across multiple servers.
Load balancing is the "traffic controller." It ensures no single server bears too much demand. By spreading the load, it optimizes resource use, maximizes throughput, minimizes response time, and ensures fault tolerance if one server crashes.
Common Examples
- We use a layer-7 load balancer to route traffic based on the user’s region, drastically reducing latency for our global policyholders.
- Load balancing is the primary mechanism that prevents our API gateway from crashing during heavy peak-hour traffic spikes.