Details
-
Enabler
-
Must have
-
None
-
Obs Mgt & Controls
-
-
-
2
-
0
-
-
-
15.6
Description
Refactor current Telescope On/OFF functionality so that it allows for the TMC to handle "compute infrastructure-related" failures on SDP, CSP, and Dishes whilst performing the act of switching on (e.g. after returning OK but before successfully switched ON/OFF).
Note the handling of this shall be limited to "passive" measures only: E.g appropriate log messages on the client-side to indicate to an investigator when and why something went wrong in the execution of ON/OFF on one of the servers. Also, the scope of these messages/notifications will be limited to leaf nodes monitoring their counterpart devices (e.g. CSP Master Leaf node monitoring CSP Controller counterpart)
Verification shall be done by means of tests on the TMC repo side in which server-side failures can be injected using mocks.
Also, see below for technical discussion:
https://docs.google.com/document/d/15LAXGcrQVfUEQp3-1nPpcUaNNxhH8oszA2u8z1nlS3w/edit#