Latency articles

11/1/2025 • EN

⚡️Does saving 1 millisecond really matter?* Answer: more than you’d think.

Explores the compounding impact of shaving milliseconds off microservice latency in distributed systems, affecting throughput and scalability.

Backend Development distributed systems latency Microservices performance optimization

Benjamin Cane

9/27/2025 • EN

Improve performance and reduce chances of request failures with this one simple trick! Avoid cross-region calls.

Explains how avoiding cross-region calls in microservices improves performance and resilience, and discusses the complexities of designing for regional isolation.

Cell Based Architecture distributed systems latency performance optimization Resilience

Benjamin Cane

8/23/2025 • EN

Sometimes when I tell people that logging can impact a microservices response time, I get strange looks. 🤨

Explains how improper logging can severely impact microservice latency and offers solutions like adjusting log levels and using async logging.

Asynchronous latency Logging Microservices performance

Benjamin Cane

7/26/2025 • EN

These 3 alerts catch the most issues

A software engineer shares three highly effective production alerts for catching bugs and system issues, based on real-world experience.

Alerting latency Monitoring observability sql

Swizec Teller

1/6/2025 • EN

RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.

Argues that RAG system failures stem from data engineering issues like fragmented data and governance, not from model or vector database choices.

Data Engineering Hybrid Search latency Rag Vector Databases

Alex Merced

1/10/2023 • EN

Monitoring latency in a Microservice Architecture

A case study on implementing a custom microservice (Chronos) to measure end-to-end latency in a microservice architecture.

distributed tracing latency Microservices Monitoring observability

Thomas Uhrig

2/3/2021 • EN

Where's the fastest place to put my server? How much does it matter?

Analyzes how server location and CDNs impact website speed using real latency data and access logs.

API Optimization cdn latency Network Performance Server Location

Cal Paterson

7/14/2020 • EN

The Emerging Landscape of Edge-Computing

Explores the evolution of edge computing from its initial consumer-focused vision to its current industrial applications and challenges.

cloud computing Cyber Foraging Edge Computing latency Mobile Devices

Mikhail Shilkov

12/24/2017 • EN

Computer latency: 1977-2017

A technical analysis measuring computer latency from 1977-2017, finding that some modern machines are slower than 40-year-old computers.

benchmark hardware keyboard input latency performance

Dan Luu

1/24/2016 • EN

Sampling v. tracing

Explains why sampling profilers fail at debugging tail latency and introduces Google's event tracing framework as a solution.

latency performance profiling sampling tracing

Dan Luu

11/1/2015 • EN

Infinite disk

Explores how hardware latency, especially disk vs. network speeds, enables the concept of 'infinite' disaggregated storage in data centers.

distributed storage hardware performance latency network latency ssd

Dan Luu

3/5/2015 • EN

Goodhearting IQ, cholesterol, and tail latency

Explores the challenge of measuring long-term success, using the Perry Preschool Study and Head Start program as examples of initial vs. lasting outcomes.

latency measurement optimization problem solving system design

Dan Luu

6/23/2014 • EN

Measuring the impact of the .NET Garbage Collector - An Update

An update on measuring .NET GC performance, correcting methodology and interpreting results with expert feedback.

.net Garbage Collector latency measurement performance

Matt Warren

3/5/2013 • EN

Latency mitigation strategies (by John Carmack)

John Carmack's archived article on reducing latency in virtual reality systems to improve user experience and prevent simulator sickness.

head-mounted display latency performance optimization real-time systems virtual reality

Dan Luu

2/8/2013 • EN

Save Bytes, Your Sanity and Money

A guide to optimizing web performance by reducing request latency, streamlining responses, and leveraging modern infrastructure to handle traffic efficiently.

Infrastructure latency optimization Scalability web performance

Simon Waight