SRE

site reliability engineering, observability, incident response, SLOs

Feeds to Scour
SubscribedAll
Scoured 239 posts in 7.2 ms

Ops I did it again: The SRE Extension is out!

 ☁️GCP  Content type: Blog
medium.com
·

Komodor Brings Autonomous AI to SRE With Reliability-First Cloud Optimization

 ☸️Kubernetes
cloudnativenow.com·

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

 🚀LLM Serving
devops.com·

Explore OpenSearch 3.7

 ⚙️ML Infrastructure  Content type: Blog
opensearch.org·

Profile-Driven Observability

 🚀LLM Serving

The Ultimate Windows Security Event ID Cheatsheet for Blue Teams & DFIR

 🤝AI-Assisted Coding  Content type: Blog
medium.com
·

Cloud Post-Mortem #3: The AI That Didn’t Invent Anything — But Weaponized Everything

 🤝AI-Assisted Coding  Content type: Blog
medium.com
·

Monitor and govern AI agents in production with AgentOps - Azure AI Tech Accelerator

 🤝AI-Assisted Coding

Cybersecurity graduate seeking Information Security Analyst, Cyber Security Anal...

 🦀Systems Programming  Content type: Discussion

The Four Knobs of AI Agent Reliability: A DevOps View

 🕸️Distributed Systems  Content type: Blog
talent500.com·

Building a Zero-Server Network Forensics Suite with Rust and WebAssembly

 🦀Systems Programming  Content type: Code
github.com··DEV

Trace n8n workflow and node executions with OpenTelemetry

 📋Kueue  Content type: Blog
blog.n8n.io·

Benchmarking and Exploring the Capabilities of LLMs for Attack Investigations

 🚀LLM Serving  Content type: Academic
arxiv.org·

Microsoft releases incident response playbook for Copilot and Azure AI

 🤝AI-Assisted Coding
4sysops.com·

Full Observability for Pinecone: Introducing an Open-Source Monitoring Stack for SaaS and BYOC

 ☸️Kubernetes  Content type: Blog
pinecone.io·

How Cisco IT cut observability costs by 86% and eliminated major network outages

 🕸️Distributed Systems  Content type: News
networkworld.com·

How Threat Intelligence Improves Detection and Response Across Digital Enterprises?

 🌐Semiconductor Geopolitics  Content type: Blog
medium.com·

Facebook spent billions on the metaverse and all they got was a new name

 🌐Cilium
disassociated.com·

How 24/7/365 SOC Improves Incident Response Times?

 🕸️Distributed Systems  Content type: Blog
medium.com·

How DevOps Engineers Can Use AI to Triage Production Incidents Faster

 ☸️Kubernetes  Content type: Blog
devopsaitoolkit.com··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help