Benjamin Cane

Benjamin Cane shares insights on distributed systems, reliability patterns, performance testing, and engineering leadership, focusing on practical lessons for building resilient software.

https://bencane.com

RSS Feed

1/22/2026

Distributed Systems Performance Testing Reliability Patterns Engineering Leadership Scalability Software Architecture Benchmarking System Monitoring DevOps #Bengineering

Articles from this Blog

28 articles from this blog

3/13/2026 • EN

You may be building for availability, but are you building for resiliency?

Explains the difference between high availability and high resiliency in system design, and why both are crucial.

distributed systems system design fault tolerance

3/6/2026 • EN

When your coding agent doesn’t understand your project, you’ll get junk

Explains how to improve AI coding agent results by providing project context via an AGENTS.md file.

software development developer productivity AI Coding Assistants

2/13/2026 • EN

Why is Infrastructure-as-Code so important? Hint: It's correctness

Explains why Infrastructure-as-Code's primary benefit is correctness and consistency, not just speed, leading to stable production environments.

DevOps software development automation

2/6/2026 • EN

Optimizing the team’s workflow can be more impactful than building business features

Explains why optimizing team workflows and fixing inefficiencies can have a greater long-term impact than just shipping new business features.

software development Team Productivity Codebase Maintenance

1/30/2026 • EN

I follow an architecture principle I call The Law of Collective Amnesia

A software architect introduces 'The Law of Collective Amnesia' to explain how system design intent fades over time and offers strategies to defend architecture.

software architecture code maintainability technical debt

1/23/2026 • EN

Performance testing without a target is like running a race with no finish line

Explains the critical importance of defining clear performance targets and monitoring production metrics for effective software performance testing.

software development qa benchmarking

1/16/2026 • EN

Many teams think performance testing means throwing traffic at a system until it breaks. That approach is fine, but it misses how systems are actually stressed in the real world.

Explains the difference between benchmark and endurance performance testing, and why both are needed for real-world system reliability.

software quality performance testing Load Testing

1/9/2026 • EN

Pre-populating caches is a “bolt-on” cache-optimization I've used successfully in many systems. It works, but it adds complexity

Explores pre-populating caches as a performance optimization, discussing its benefits, implementation trade-offs, and added complexity.

performance optimization caching Data Synchronization

1/2/2026 • EN

Don't be afraid to build a tool. Just don't become too attached to it.

A guide on when and how to build internal tools for development teams, emphasizing practicality over attachment.

DevOps software development cli

12/27/2025 • EN

One of the toughest engineering skills to develop is accepting a decision you disagree with. 😖

A guide for engineers on when to challenge technical decisions and when to accept and support them for team cohesion.

software engineering decision making engineering culture

12/20/2025 • EN

Canary deployments are an operational superpower, but the complexity they bring isn’t for everyone.

Compares Canary and Blue/Green deployment strategies, explaining their complexities, use cases, and when each is optimal for software releases.

DevOps software architecture Canary Deployments

12/13/2025 • EN

Everyone has bias, yes, even you. 🫵

A guide to recognizing and managing personal bias in technical decision-making, focusing on objective data and open-minded discussions.

software development decision making Team Collaboration

12/6/2025 • EN

Do you use Architecture Decision Records?* I’m a big fan, and I think they’re a best practice every engineering org should adopt.

Explains the value of Architecture Decision Records (ADRs) for documenting technical choices and fostering a collaborative engineering culture.

software development decision making engineering culture

11/29/2025 • EN

Does resource usage within your application or database suddenly spike periodically?* Does it cause system slowdown?

Explains how adding random jitter to scheduled tasks can prevent synchronized resource spikes and improve application performance.

performance optimization scheduled-tasks Resource Management

11/22/2025 • EN

When you shut down an application instance, don't stop the listener immediately — that's how you end up with failed requests during every application rollout. 😢

Explains why stopping a listener immediately during app shutdown causes failed requests and details the correct graceful shutdown sequence.

DevOps Kubernetes Graceful Shutdown

11/15/2025 • EN

A common issue I see when teams first adopt `gRPC` is managing persistent connections, especially during failovers.

Explains gRPC's persistent connection challenges during failover and offers solutions like HTTP/2-aware load balancers.

http/2 Grpc Failover

11/8/2025 • EN

A dangerous mindset I’ve seen—and been guilty of—is assuming code doesn't change.

A developer discusses the dangers of assuming code won't change or be misunderstood, advocating for defensive programming practices.

api design code quality software maintenance

11/1/2025 • EN

⚡️Does saving 1 millisecond really matter?* Answer: more than you’d think.

Explores the compounding impact of shaving milliseconds off microservice latency in distributed systems, affecting throughput and scalability.

performance optimization distributed systems latency

10/28/2025 • EN

Have you heard of Store and Forward?* It’s a resiliency design prevalent in card & bank payments, telecommunications, and other industries.

Explains the Store and Forward resiliency design pattern for handling service dependencies in tech systems like payments and telecom.

distributed systems Message Queue Service Architecture

10/25/2025 • EN

When Building Low-Latency, High-Scale Systems, Push as Much Processing as Possible to Later

A strategy for building low-latency systems by deferring non-essential processing to an event-driven platform to optimize real-time performance.

system design real-time Low Latency

1 2 Next