[ ] Monitoring architecture (Tier 1/2/3) - [ ] Alert threshold matrix - [ ] Production runbooks (at least 6) - [ ] DR design with RTO/RPO tiers - [ ] DR test plan and schedule