Game Day
ELI5 — The Vibe Check
A Game Day is when your engineering team deliberately causes a production incident in a controlled setting to practice: Who gets paged? Who's on call? Do the runbooks make sense? Does your monitoring actually catch the thing? Think of it as a fire drill, but instead of exiting the building, you try to stop the building from burning while everyone watches.
Real Talk
undefined
When You'll Hear This
undefined
Related Terms
Chaos Engineering
Imagine stress-testing a bridge by parking trucks on it before opening day instead of hoping it holds.
Incident Response
Incident Response is the process your team follows when production breaks. Who gets paged? Who's the incident commander?
Observability
Observability is the ability to understand what's happening inside your system from the outside, using three types of data: metrics (numbers), logs (events...