Putting a queue in front of an overloaded service feels like relief โ spikes disappear, callers get instant acknowledgements. But a queue adds no capacity. It borrows capacity from the future.
If arrival rate exceeds processing rate for any sustained window, the backlog grows without bound and latency grows with it. The queue was never a fix; it was hiding the fact that you are simply too slow. The debt comes due as ballooning queue depth and work thatโs stale by the time itโs handled.
Queues absorb bursts, not trends. They smooth spiky load, decouple producers from consumers, and buy time to scale. What they cannot do is repay a throughput deficit.
Treat queue depth as a leading indicator of trouble, and pair every queue with backpressure so the loan has a credit limit โ better to reject fast than to accept work youโll never finish.