Details
-
Spike
-
Not Assigned
-
None
-
Obs Mgt & Controls
-
-
- Demonstrate ability to debug a Configure command across all the devices involved TM, CSP and SDP.
- Call graph is not required, but at least see which devices were involved.
-
1
-
1
-
8
-
Team_KAROO
-
Sprint 3
-
-
-
6.5
-
Team_KAROO goal_O3
Description
Timeboxing this into a Spike for PI5. Further work could be done as a follow up captured in Feature SP710 (Clone)
Syslog and the Elastic stack were identified for logging solution in Pre-construction and the containerisation standards, with TANGO as telescope SCADA.
Come up with solution for distributed tracing events throughout the system, including what is handled via e.g. ELK, and TANGO. For instance, tracing from scheduling block execution to downstream events.
As a stretch, this ticket could be expanded to also integrate proposed/selected solutions for monitoring of virtual and physical compute infrastructure as well as logs and TANGO.
Example solutions (to give an idea of what is meant by tracing)
- ISTIO
- Using APM agents, Elastic APM and APM Kibana GUI supports distributed tracing. Elastic APM also automatically collects unhandled errors and exceptions.
This feature will be tested as follows:
A design review of the documentation will be held with Architects after which all of their observations are addressed. However, the design review should occur in an iterative fashion to get continuos and quick feedback. The test will, therefore, be a demonstration that all observations from architects were addressed and found acceptable. The documentation will need to be made official and be seen as part of solution intend.
Attachments
Issue Links
- is cloned by
-
SP-710 Solution for tracing events throughout the system Further work
- Discarded
- is required by
-
SP-676 Tracing solution extended to include compute infrastructure metrics
- Analyzing
- relates to
-
SP-427 Implement Platform Monitoring and Observability to support the MVP
- Done
-
SP-1122 Enable use of transaction IDs to aid in tracing of commands through the system
- Done
-
SP-395 Alarm Reporting and Logging Functionality
- Discarded