Reliability

Feeds to Scour
SubscribedAll
Scoured 173 posts in 7.1 ms

Ops I did it again: The SRE Extension is out!

 🔬eBPF  Content type: Blog
medium.com
·

Komodor Brings Autonomous AI to SRE With Reliability-First Cloud Optimization

 📦Containerization
cloudnativenow.com·

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

 📈Scalability
devops.com·

Maintain observability during cloud outages with Datadog Disaster Recovery

 📈Scalability  Content type: Blog
datadoghq.com·

Observability overload is drowning engineers

 🔬eBPF
thenewstack.io·

melancholictheory/wellcake: A Kubernetes operator for Valkey — Standalone / Replication / Sentinel / Cluster, operator-driven failover, proactive zero-downtime rolling restarts, Atomic Slot Migration, S3 backups, multi-region replication.

 📦Containerization  Content type: Code
github.com··r/devops

Practice like you play: How Amazon scales resilience to new heights (ARC316)

 🏗️Systems Design  Content type: Blog
blog.domb.net·

The single-cloud trap: why UK businesses’ multi-cloud strategy risks leaving them exposed

 📈Scalability
techradar.com
·

Azure Availability Zone Mapping and VM Resilience Analysis Guidance using SRE.AZURE.COM Agent

 📈Scalability

Our DNS servers use GeoDNS to direct connections to the lowest latency servers and implement automatic failover via health checks and 5 minute expiry for the...

 ⚖️Load Balancing
grapheneos.social·

New comment by RomainB_ in "Ask HN: Who wants to be hired? (June 2026)"

 📦Containerization  Content type: Discussion

Elastic brings AI-driven incident investigation to Kubernetes and observability tools

 📦Containerization
helpnetsecurity.com·

SwarmSense-DNN: A Trustworthy and Decentralized Neural Framework for Proactive Anomaly Defense in Consumer IoT

 🤝Consensus Protocols  Content type: Academic
arxiv.org·

The Server That Does Nothing Is Often the Most Critical

 📈Scalability
siliconopera.com·

SRE Weekly Issue #520

 ⏱️Performance
sreweekly.com·

Explore OpenSearch 3.7

 🔬eBPF  Content type: Blog
opensearch.org·

The Hidden HA Gaps Costing You Uptime | Trey Isaac, SIOS Technology

 📈Scalability  Content type: Video
youtube.com·

The Four Knobs of AI Agent Reliability: A DevOps View

 🔧Microservices  Content type: Blog
talent500.com·

SQL Server Always On Availability Groups and Database Master Keys: A Hidden Failover Pitfall

 📈Scalability  Content type: Blog
dbi-services.com·

Scale. Speed. Trust: Three Imperatives for the AI Era

 🌐Distributed Systems  Content type: Blog
blogs.cisco.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help