Add SRE improvisation
This commit is contained in:
parent
326bad03ea
commit
618c2c7ff2
@ -1083,6 +1083,12 @@ Also see the Debugging section in this doc.
|
||||
- [Incident Management Resources](https://resources.sei.cmu.edu/library/asset-view.cfm?assetID=505044), Carnegie Mellon University
|
||||
- [Sterile flight deck rule](https://en.wikipedia.org/wiki/Sterile_flight_deck_rule), Wikipedia
|
||||
- [Shamir Secret Sharing It’s 3am.](https://max.levch.in/post/724289457144070144/shamir-secret-sharing-its-3am-paul-the-head-of)
|
||||
- [Site Reliability Engineering and the Art of Improvisation](https://thenewstack.io/site-reliability-engineering-and-the-art-of-improvisation/) has lots of good training ideas
|
||||
- Walkthroughs of observability toolsets
|
||||
- Decision requirements table building
|
||||
- Team knowledge elicitation
|
||||
- Asking the question, “Why do we have on-call?”
|
||||
- Spin the Wheel of Expertise!
|
||||
|
||||
Alerting:
|
||||
|
||||
@ -1105,7 +1111,6 @@ Alerting:
|
||||
- [Human error models and management](https://app.box.com/s/7z35l09amvr1vwxouh2s)
|
||||
- High reliability organisations — which have less than their fair share of accidents — recognise that human variability is a force to harness in averting errors, but they work hard to focus that variability and are constantly preoccupied with the possibility of failure
|
||||
|
||||
|
||||
> "Let’s plan for a future where we’re all as stupid as we are today."
|
||||
>
|
||||
> – Dan Milstein
|
||||
|
Loading…
Reference in New Issue
Block a user