Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-2589

Centralised Logging and Monitoring with zero-trust principles

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Feature
    • Must have
    • PI15
    • None
    • None
    • Services
    • Hide

      Having a centralised logging and monitoring solution will contribute the following:

      • A more robust implementation of logging and monitoring
      • Eliminate many dashboards
      • Aggregate data from different datacenters for further analysis
      • Lower maintenance overhead for management of the services
      Show
      Having a centralised logging and monitoring solution will contribute the following: A more robust implementation of logging and monitoring Eliminate many dashboards Aggregate data from different datacenters for further analysis Lower maintenance overhead for management of the services
    • Hide
      • Centralised Monitoring solution is implemented based on Thanos and Prometheus with all of the ST clusters: STFC Techops, STFC SDH&P, EngageSKA, PSI-Low (PSI-Mid) with HA and zero-trust network setup
      • Grafana dashboards are updated to work with the above monitoring solution
      • ELK is deployed and configured with a HA and zero-trust network setup with all of the ST clusters: STFC Techops, STFC SDH&P, EngageSKA, PSI-Low (PSI-Mid)
      Show
      Centralised Monitoring solution is implemented based on Thanos and Prometheus with all of the ST clusters: STFC Techops, STFC SDH&P, EngageSKA, PSI-Low (PSI-Mid) with HA and zero-trust network setup Grafana dashboards are updated to work with the above monitoring solution ELK is deployed and configured with a HA and zero-trust network setup with all of the ST clusters: STFC Techops, STFC SDH&P, EngageSKA, PSI-Low (PSI-Mid)
    • 2
    • 0
    • Team_SYSTEM
    • 15.6
    • PI22 - UNCOVERED

    Description

      Centralised Logging and Monitoring is implemented so that there's no need to have different managed grafana dashboards and kibana.

      This means that we only have the dashboards on STFC for global usage with integrated dashboards and all logging and monitoring is aggregated using in-transit encrypted traffic for security. 

      Current logging is based on Prometheus and on PI14, as part of ST-1195 a preliminary analysis of Thanos is investigated as a tool for centralised logging solution. (Report is here: https://confluence.skatelescope.org/display/SE/Thanos+Investigation )

      Attachments

        Issue Links

          Structure

            Activity

              People

                m.deegan Deegan, Miles
                U.Yilmaz Yilmaz, Ugur
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (100.00%)

                  Feature Estimate: 2.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete49.0
                  Total49.0

                  Dates

                    Created:
                    Updated:
                    Resolved:

                    Structure Helper Panel