Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-4468

Draft: Health of the infrastructure to be captured in the Operations Admin portal

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • SRCnet
    • Hide

      The SOG needs to know when a site is down. This will help debug service level issues, identify how the SRCNet needs to respond (eg, are all services down or is the infrastructure down?), and understand load distribution across SRCNodes. 

      Eventually, this might help gain insight to how user payloads perform on different sites (if the metrics reported are rich and enable this)

      Show
      The SOG needs to know when a site is down. This will help debug service level issues, identify how the SRCNet needs to respond (eg, are all services down or is the infrastructure down?), and understand load distribution across SRCNodes.  Eventually, this might help gain insight to how user payloads perform on different sites (if the metrics reported are rich and enable this)
    • Hide

      Node exporter is an industry standard cloud native tool that gathers kubernetes nodes metrics to be served into Prometheus/Grafana. Prometheus configured to point to node-exporter services at 2 different sites. Grafana dashboards to view node health per site.

      Show
      Node exporter is an industry standard cloud native tool that gathers kubernetes nodes metrics to be served into Prometheus/Grafana. Prometheus configured to point to node-exporter services at 2 different sites. Grafana dashboards to view node health per site.
    • PI24 - UNCOVERED

    • SRCNet0.x

    Description

      Feature: Health of the infrastructure to be captured in the Operations Admin portal

       

      https://docs.google.com/presentation/d/1EXSiIoot-8gPEXNoTaLirPTpwvkhFrEeat0iifzm3kg/edit?pli=1#slide=id.g21ba435453c_0_1 

      Attachments

        Issue Links

          Structure

            Activity

              People

                b.mort Mort, Ben
                r.joshi Joshi, Rohini
                Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (0%)

                  Feature Estimate: 0.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete00.0
                  Total00.0

                  Dates

                    Created:
                    Updated:

                    Structure Helper Panel