DevOps-to-SRE Transition: On-Call Without Runbooks Is Obsolete
blog.devops.dev
·21h
🔧DevOps
Preview
Report Post

5 min read5 days ago

1. The 3 A.M. Page That Should Never Exist

It’s 3:07 a.m. A service is down. The alert tells you something is wrong, not why. You Slack the one engineer who “knows this system.” They remember a similar incident from six months ago. A manual restart. A config tweak. The page clears. Everyone goes back to sleep.

By 10 a.m., nobody has written anything down. The system is still fragile. The next page is inevitable.

Now contrast that with another team: the same failure condition occurs — and nothing pages. Traffic is throttled automatically. A dependency is drained. The system degrades gracefully and recovers without human intervention.

Press enter or click to view image in full size

This isn’t about smarter engineers. It’s about arc…

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help