You may be building for availability, but are you building for resiliency?
Explains the difference between high availability and high resiliency in system design, and why both are crucial.
Explains the difference between high availability and high resiliency in system design, and why both are crucial.
A developer shares insights and design considerations for building a distributed actor framework, based on experience with Dapr, Orleans, and a custom Go project.
Part 2 of a guide on running Apache Flink on Kubernetes, covering fault tolerance, high availability, savepoints, and observability.
Part 2 of a guide on running Apache Flink on Kubernetes, covering fault tolerance, high availability, savepoints, and observability.
A review and analysis of the classic 1989 paper on using timed leases for cache consistency in distributed file systems.
Announcing the 1.0.0 Release Candidate for IdempotentAPI, a .NET library for building fault-tolerant, idempotent Web APIs in distributed systems.
Analysis of how 'limping' or slow hardware nodes can severely degrade distributed system performance, based on academic research.
Introduces Akka, an open-source toolkit for building concurrent, scalable, and fault-tolerant applications using the Actor Model in Scala and Java.
Introduces Active Objects, a Scala library combining actors, supervisor hierarchies, and OO methods for fault-tolerant, concurrent asynchronous components.
Introduces a Scala library implementing Erlang-style Supervisor hierarchies for building fault-tolerant actor systems.