AWS ELB Surge Queue | Blue Matador

Effects

It is normal for a high-traffic load balancer to have a non-zero surge queue length. An increase in this metric, however can mean delayed response times for your users. Once the limit of 1024 queued requests is reached, requests will be rejected by the load balancer.

Troubleshooting

Use application access logs or ELB access logs to investigate traffic being sent to your application. A high-traffic endpoint which has an increased response time can easily back up a web server. Additionally, any high latency endpoint that is seeing an increase in traffic could cause the entire web server to back up.

In the short-term, you can add capacity to the load balancer to alleviate a high surge queue. This is an appropriate action to take in response to a higher request rate. If a load balancer’s instances are managed by an AutoScaling Group, make sure that the ASG is correctly scaling up and down in response to traffic.

The long-term solution is to find the bottleneck in your application code and fix it. This can mean adjusting thread pool sizes, caching expensive operations, or simply scaling up.

Resources

Elastic Load Balancing Metrics and Dimensions (AWS Documentation)
Access Logs for Elastic Load Balancers (AWS News Blog)