Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-4468

Draft: Health of the infrastructure to be captured in the Operations Admin portal

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Feature
    • Not Assigned
    • PI25
    • None
    • SRCnet
    • Hide

      The SOG needs to know when a site is down. This will help debug service level issues, identify how the SRCNet needs to respond (eg, are all services down or is the infrastructure down?), and understand load distribution across SRCNodes. 

      Eventually, this might help gain insight to how user payloads perform on different sites (if the metrics reported are rich and enable this)

      Show
      The SOG needs to know when a site is down. This will help debug service level issues, identify how the SRCNet needs to respond (eg, are all services down or is the infrastructure down?), and understand load distribution across SRCNodes.  Eventually, this might help gain insight to how user payloads perform on different sites (if the metrics reported are rich and enable this)
    • Hide

      Node exporter is an industry standard cloud native tool that gathers kubernetes nodes metrics to be served into Prometheus/Grafana. Prometheus configured to point to node-exporter services at 2 different sites. Grafana dashboards to view node health per site.

      Show
      Node exporter is an industry standard cloud native tool that gathers kubernetes nodes metrics to be served into Prometheus/Grafana. Prometheus configured to point to node-exporter services at 2 different sites. Grafana dashboards to view node health per site.
    • PI24 - UNCOVERED

    • SRCNet0.x

    Description

      Feature: Health of the infrastructure to be captured in the Operations Admin portal

       

      https://docs.google.com/presentation/d/1EXSiIoot-8gPEXNoTaLirPTpwvkhFrEeat0iifzm3kg/edit?pli=1#slide=id.g21ba435453c_0_1 

      Attachments

        Structure

          Activity

            People

              b.mort Mort, Ben
              r.joshi Joshi, Rohini
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Feature Progress

                Story Point Burn-up: (0%)

                Feature Estimate: 0.0

                IssuesStory Points
                To Do00.0
                In Progress   00.0
                Complete00.0
                Total00.0

                Dates

                  Created:
                  Updated:

                  Structure Helper Panel