Loading...

Change Owns to Parent Ofs

Set start and due date...

Xporter

XML

Word

Printable

Details

Type: Enabler
Priority: Should have
Fix Version/s: PI17
Component/s: COM Software Services
Labels:
- Team_BANG

ARTs:

Services
Benefit hypothesis:
Hide

In order to support the efficient use of AA0.5 ITF Platform Environments, A solution is required that provides observability, and notification of platform state/events.

This requires a decoupling of the infrastructure monitoring/logging from the services and application layers

The introduction of targeted alerts - eg: network/compute/storage/accessibility (include core services eg: ceph/elastic/k8s

Log tagging, views, and dashboards that provide platform insight

Visualisations that provide infra and core services insight
Show
In order to support the efficient use of AA0.5 ITF Platform Environments, A solution is required that provides observability, and notification of platform state/events. This requires a decoupling of the infrastructure monitoring/logging from the services and application layers The introduction of targeted alerts - eg: network/compute/storage/accessibility (include core services eg: ceph/elastic/k8s Log tagging, views, and dashboards that provide platform insight Visualisations that provide infra and core services insight
Acceptance criteria:
Hide

monitoring and logging deployment and node/service integration is abstracted away (in Ansible) so that it can be independently deployed

Probes exist for basic compute, network storage for each node - can be used for VMs and baremetal

Probes exist for core service availability - ceph, elastic, core databases (Prometheus?), k8s (platform availability and loading)

Alerts exist that will show core service accessibility/health issues

Alerts exist that expose core infra/service resource issues such as excess load, diskspace

Log tagging, views, and dashboards that provide platform insight

Visualisations that provide infra and core services insight - OS level metrics
Show
monitoring and logging deployment and node/service integration is abstracted away (in Ansible) so that it can be independently deployed Probes exist for basic compute, network storage for each node - can be used for VMs and baremetal Probes exist for core service availability - ceph, elastic, core databases (Prometheus?), k8s (platform availability and loading) Alerts exist that will show core service accessibility/health issues Alerts exist that expose core infra/service resource issues such as excess load, diskspace Log tagging, views, and dashboards that provide platform insight Visualisations that provide infra and core services insight - OS level metrics
Feature Points:
3
Initial Size:
3
WSJF:
0
Epic Link:
Infrastructure & Platforms
Agile Teams:

Team_BANG
Due Sprint:
Sprint 5
Story Point Burn-up:
Overdue:
Outcomes:
Hide

monitoring and logging deployment and node/service integration is abstracted away (in Ansible) so that it can be independently deployed

Probes exist for basic compute, network storage for each node - can be used for VMs and baremetal

Probes exist for core service availability - ceph, elastic, core databases (Prometheus?), k8s (platform availability and loading)

Alerts exist that will show core service accessibility/health issues

Alerts exist that expose core infra/service resource issues such as excess load, diskspace

Log tagging, views, and dashboards that provide platform insight

Visualisations that provide infra and core services insight - OS level metrics
Show
monitoring and logging deployment and node/service integration is abstracted away (in Ansible) so that it can be independently deployed Probes exist for basic compute, network storage for each node - can be used for VMs and baremetal Probes exist for core service availability - ceph, elastic, core databases (Prometheus?), k8s (platform availability and loading) Alerts exist that will show core service accessibility/health issues Alerts exist that expose core infra/service resource issues such as excess load, diskspace Log tagging, views, and dashboards that provide platform insight Visualisations that provide infra and core services insight - OS level metrics
Resolved PI.Sprint:
17.6

Feature Checklist:

Stories Completed, Integrated, Outcomes Reviewed, Satisfies Acceptance Criteria, Accepted by FO

Requirement Status:

PI24 - UNCOVERED
Labels_MIRO:
Team_BANG

Description

Refactor existing platform monitoring/logging/alerting to provide infra observability: service performance, availability, and capacity

Attachments

Issue Links

clones

SP-2681 KVM/VM management on BM

Done

is cloned by

SP-2694 Evolutionary network requirements for Infra AA0.5

Funnel

SP-2699 Pivot Kubernetes deployment/management to clusterapi

Discarded

mentioned in: Page Loading...; Page Loading...

Structure

Activity

People

Assignee:: Deegan, Miles

Reporter:: Harding, Piers

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Feature Progress

Story Point Burn-up: (100.00%)

Feature Estimate: 3.0

	Issues	Story Points
To Do	0	0.0
In Progress	0	0.0
Complete	5	10.5
Total	5	10.5

Dates

Created:: 08/Aug/22 7:53 AM

Updated:: 16/Feb/24 12:07 PM

Resolved:: 27/Feb/23 1:10 PM

Extract/refactor monitoring and logging for Infra layer