Fault Tolerance

Feeds to Scour
SubscribedAll
Scoured 152 posts in 7.3 ms

Maintain observability during cloud outages with Datadog Disaster Recovery

 🛡️Reliability Engineering  Content type: Blog
datadoghq.com·

melancholictheory/wellcake: A Kubernetes operator for Valkey — Standalone / Replication / Sentinel / Cluster, operator-driven failover, proactive zero-downtime rolling restarts, Atomic Slot Migration, S3 backups, multi-region replication.

 🌲Persistent Data Structures  Content type: Code
github.com··r/devops

Building a Concurrent AI Call Center for Field Service Operations — VAPI, Idempotency, and What…

 ⏸️Backpressure  Content type: Blog
medium.com
·

re:Invent 2022 Building Confidence Through Chaos Engineering on AWS

 🛡️Reliability Engineering  Content type: Blog
blog.domb.net·

Introducing the Snowflake and AWS Custom Lens for the AWS Well-Architected Framework

 ⏸️Backpressure  Content type: Blog
aws.amazon.com·

Our DNS servers use GeoDNS to direct connections to the lowest latency servers and implement automatic failover via health checks and 5 minute expiry for the...

 🔀Concurrency
grapheneos.social·

SQL Server Always On Availability Groups and Database Master Keys: A Hidden Failover Pitfall

 🛡️Reliability Engineering  Content type: Blog
dbi-services.com·

When Your AI Provider Fails: Building a Resilient Fallback System

 ⏸️Backpressure
gist.github.com··DEV

Designing for High Availability: The Operational Reference for Running a Geo-Replicated ACR

 ⏸️Backpressure

New comment by mattmanser in "LLMs are eroding my software engineering career and I don't know what to do"

 🔄REPL-Driven Development  Content type: Discussion

Cascading Failure: The Spanish Navy’s Reserve Squadron and the Tragedy of Unpreparedness

 📚Etymology
warontherocks.com·

Pgpool-II 4.7.2, 4.6.7, 4.5.12, 4.4.17 and 4.3.20 released.

 ⏸️Backpressure
postgresql.org·

Gauging the Spacetime Code

 🤖Claude Code  Content type: Academic
arxiv.org·

The Server That Does Nothing Is Often the Most Critical

 🔀Concurrency
siliconopera.com·

Take LangGraph to Production | Durable AI Agents

 🔀Concurrency  Content type: Discussion  Content type: Tutorial
diagrid.io·

Cache Stampede Prevention: Distributed Locking, Pub/Sub, and Request Coalescing

 🤖Claude Code  Content type: News  Content type: Blog

Nemotron 3 Ultra now available on AI Gateway

 ⏸️Backpressure
vercel.com·

Route public traffic to private applications with Cloudflare

 ⏸️Backpressure  Content type: Blog
blog.cloudflare.com·

The Hidden HA Gaps Costing You Uptime | Trey Isaac, SIOS Technology

 🛡️Reliability Engineering  Content type: Video
youtube.com·

Cascading failures triggered by localized vulnerabilities in higher-order networks

 🛡️Reliability Engineering  Content type: Academic
sciencedirect.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help