Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-3823

Investigate functionality, performance and scaling of other existing pipelines

Details

    • Enabler
    • Should have
    • PI21
    • COM SDP SW
    • None
    • Data Processing
    • Hide

      See Why? in description

      Show
      See Why? in description
    • Hide

      See What? in description

      Show
      See What? in description
    • 8
    • 8
    • 0
    • Team_HIPPO, Team_YANDA
    • Sprint 5
    • Overdue
    • Hide

      HIPPO PI21:
      We have investigated the image quality produced and performance of the DDF (DDFacet and killMS) pipeline on the same meerKAT dataset as was used for the MID Self-Cal pipeline. DDF pipeline is using a different set of software tools and algorithms to produce radio interferometry images. For the purpose of imaging MeerKAT data the DDF pipeline had to be modified. A summary, that describes installation, configuration and changes together with the final image can be found on the confluence page https://confluence.skatelescope.org/display/SE/HIP-766%3A+Run+ddf-pipeline+with+MeerKAT+data. There was no direct comparison done between MID Self-Cal and DDFacet pipeline this PI (PI21).
      The benchmarks were limited to the total execution time of 12 hours on 1TB, 96-core 2x AMD EPYC 9454 node. Further benchmarks will be required. 

      Show
      HIPPO PI21: We have investigated the image quality produced and performance of the DDF (DDFacet and killMS) pipeline on the same meerKAT dataset as was used for the MID Self-Cal pipeline. DDF pipeline is using a different set of software tools and algorithms to produce radio interferometry images. For the purpose of imaging MeerKAT data the DDF pipeline had to be modified. A summary, that describes installation, configuration and changes together with the final image can be found on the confluence page https://confluence.skatelescope.org/display/SE/HIP-766%3A+Run+ddf-pipeline+with+MeerKAT+data . There was no direct comparison done between MID Self-Cal and DDFacet pipeline this PI (PI21). The benchmarks were limited to the total execution time of 12 hours on 1TB, 96-core 2x AMD EPYC 9454 node. Further benchmarks will be required. 
    • PI22 - UNCOVERED

    • Low G4 Mid G3

    Description

      See frame in PI21 Backlog board


      Who? (Beneficiaries)

      • Pipeline developers.
      • System scientists (Commissioning).
      • Commissioning and Operations staff planning for science commissioning & verification ahead of and during AA2.

      Why? (Benefit hypothesis)

      • We need to ensure that we maintain a good idea of what the state-of-the art is in terms of radio astronomy pipelines.
      • As the large data amounts will make it tricky to test externally, we will likely even need to make preparations for running external pipelines on the SDP system in order to do testing.

      What? (Acceptance criteria)

      • (Re-)establish "radar" of pipelines and technologies - idea is that we periodically check their relevance to the SKA. (DDFacet: HIPPO, ASKAPSoft: YANDA) Recommended engagement levels might be something along the lines of:
        • Hold (we don't currently think they have something to offer us, but track them in case something happens);
        • Assess (there are interesting aspects to it that we might want to adopt, so we'd look into the design / trial integration);
        • Trial (we actually do end-to-end runs with our datasets, and aim for loose integration into SDP as batch processing blocks);
        • Adopt (we fully integrate, and actively work on all qualities relevant to us).
      • Do this for some reasonably broad categorisation:
        • Preprocessing/self-calibration/instrumental calibration/data product preparation pipelines, individual algorithms etc.?
        • Architectural qualities we are interested in would be - as usual - modifiability and scalability & performance,
        • Functional qualities as of SKA requirements
        • YANDA: Analysis of ASKAPSoft continuum imaging with distributed facets (scalability, performance and perhaps modifiability)
      • Do deeper analysis / test runs of pipelines in "assess" or better.

      Attachments

        Issue Links

          Structure

            Activity

              People

                b.nikolic Nikolic, Bojan
                m.ashdown Ashdown, Mark
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (89.74%)

                  Feature Estimate: 8.0

                  IssuesStory Points
                  To Do12.0
                  In Progress   00.0
                  Complete1117.5
                  Total1219.5

                  Dates

                    Created:
                    Updated:

                    Structure Helper Panel