The problem with this kind of theoretical analysis is that most load balancers *...

zer00eyz · 2026-06-20T14:55:19 1781967319

> can only handle a single concurrent request, and has no internal queuing

And the systems that load balancers front almost never behave this way...

Dont even get me started on client performance here as well, latency, speed, caching -- that can all be impacted by payload size.

The article is interesting, but it is an ideal that almost never turns up in the real world.

jiggawatts · 2026-06-20T21:15:45 1781990145

Precisely. I was going to further add that most load balancers are network appliances and hence:

a) There must be network packet buffers on each receiver, which is a per-server "queue".

b) The optimal performance can only be achieved by pipelining requests sent to the servers, otherwise there is "dead time" caused by the network latency.

Perfect queuing to workers that act as a "slot" can only be implemented in-process on a single machine, and even there having small queues or buffers per core can improve throughput!