Details
-
Feature
-
Could have
-
None
-
None
-
LOW ART
-
-
-
Inter Program, Intra Program
-
3
-
3
-
0
-
-
Description
This feature will cover the initial development of an evolutionary Site Reliability Engineering Mgt Plan for the Infrastructure, Computing and Network systems needed to support the SKA Low telescope operations.
Some possible areas that may be covered are:
- SRE intro and benefits
- Monitoring, alerting and response procedures
- RAMS analysis -> SLOs and error budgets
- Automation
- On call arrangements
- Incident Management
- Release Mgt
- Change Mgt
- Postmortems
- Communications and notifications