Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-4502

Monitoring of processing scripts and workflows

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Feature
    • High
    • PI24
    • COM SDP SW
    • None
    • Data Processing
    • Hide

      As a AIV Engineer / Telescope Operator, I want to be informed immediately if there is a error in real-time processing, either in the processing scripts which orchestrate the execution of processing blocks, or in the execution engines and workflows that they deploy.

      Show
      As a AIV Engineer / Telescope Operator, I want to be informed immediately if there is a error in real-time processing, either in the processing scripts which orchestrate the execution of processing blocks, or in the execution engines and workflows that they deploy.
    • Hide
      1. Each processing script monitors the execution engines and workflows it deploys, and reports any errors encountered, including in the script itself.
      2. The processing controller monitors the processing scripts and, as a last resort, reports if a processing script has failed.
      3. The Helm Deployer collects error messages from deployments and stores the information in deployment state.
      4. Each subarray monitors its associated real-time processing blocks and reports errors via its errorMessage attribute.
      5. Each subarray sets its ObsState to FAULT and HealthState to DEGRADED/FAILED when (agreed) conditions are met.
      6. Implementation covers all six scenarios described in the Design link below (Confluence page).
      Show
      Each processing script monitors the execution engines and workflows it deploys, and reports any errors encountered, including in the script itself. The processing controller monitors the processing scripts and, as a last resort, reports if a processing script has failed. The Helm Deployer collects error messages from deployments and stores the information in deployment state. Each subarray monitors its associated real-time processing blocks and reports errors via its errorMessage attribute. Each subarray sets its ObsState to FAULT and HealthState to DEGRADED/FAILED when (agreed) conditions are met. Implementation covers all six scenarios described in the Design link below (Confluence page).
    • Intra Program
    • 3
    • 3
    • 0
    • Team_ORCA
    • Sprint 5
    • PI24 - UNCOVERED

    Description

      See Feature frame on DP ART board.

      Context

      See Capabilities SS-155, SS-156 and SS-185.

      At present the SDP does not systematically monitor processing scripts and execution engines / workflows for any errors or failures that may occur. It only happens during some of the "long-running" subarray commands. The AssignResources command waits for the real-time processing scripts to indicate they have started successfully, and the Configure command waits for the execution engine deployments to succeed. If an error does occur it is not reported immediately, but only indirectly when the command times out. If an error occurs outside one of these commands, then it is not reported.

      References

      Attachments

        Issue Links

          Structure

            Activity

              People

                m.ashdown Ashdown, Mark
                m.ashdown Ashdown, Mark
                Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (0%)

                  Feature Estimate: 3.0

                  IssuesStory Points
                  To Do10.0
                  In Progress   37.0
                  Complete00.0
                  Total47.0

                  Dates

                    Created:
                    Updated:

                    Structure Helper Panel