Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-4110

Self-calibration performance improvements through reduced I/O

Details

    • Feature
    • Must have
    • PI22
    • COM SDP SW
    • None
    • Data Processing
    • Hide

      Similar as with the scaling feature, for the benefit of system scientists (Commissioning) and operations we need to care about raw performance of (self-)calibration and imaging pipeline. As we are dealing with an exceptionally data-intensive domain, one of the main ways that we can improve performance (or unlock further performance improvements down the line) is to carefully engineer our pipelines for maximising the efficiency of our I/O - specifically reading and writing visibilities and images (the largest data items) from or to storage as rarely as we can get away with.

      For self-calibration specifically, the theoretical optimum is to read visibilities only once per major loop or self-calibration iteration. At present, the situation seems to be that we are rather reading measured visibilities once per facet every major loop, as well as an entire read-write cycle for the visibility models. Furthermore, we are writing and re-writing visibilities between DP3 and WSClean. All of these are  obvious inefficiencies that we should work towards resolving, even if they might not necessarily give us immediate returns on performance.

      Show
      Similar as with the scaling feature, for the benefit of system scientists (Commissioning) and operations we need to care about raw performance of (self-)calibration and imaging pipeline. As we are dealing with an exceptionally data-intensive domain, one of the main ways that we can improve performance (or unlock further performance improvements down the line) is to carefully engineer our pipelines for maximising the efficiency of our I/O - specifically reading and writing visibilities and images (the largest data items) from or to storage as rarely as we can get away with. For self-calibration specifically, the theoretical optimum is to read visibilities only once per major loop or self-calibration iteration. At present, the situation seems to be that we are rather reading measured visibilities once per facet every major loop, as well as an entire read-write cycle for the visibility models. Furthermore, we are writing and re-writing visibilities between DP3 and WSClean. All of these are  obvious inefficiencies that we should work towards resolving, even if they might not necessarily give us immediate returns on performance.
    • Hide

      WSClean optimisations

      • Introduce a way for WSClean to overlap I/O with computation (can boil down to just running instances / tasks in parallel - but need to consider memory usage!) [PANDO]
      • Allow sharing of visibilities between facets without the need to make copies for applying calibration (ideally for both predict and invert) [PANDO]

      Improve storage interface

      • Support for writing data from DP3
      • Support for reading data into WSClean
      Show
      WSClean optimisations Introduce a way for WSClean to overlap I/O with computation (can boil down to just running instances / tasks in parallel - but need to consider memory usage!) [PANDO] Allow sharing of visibilities between facets without the need to make copies for applying calibration (ideally for both predict and invert) [PANDO] Improve storage interface Support for writing data from DP3 Support for reading data into WSClean
    • 8
    • 8
    • 0
    • Team_PANDO
    • Sprint 3
    • Overdue
    • PI22 - UNCOVERED

    Attachments

      Structure

        Activity

          People

            p.wortmann Wortmann, Peter
            D.Fenech Fenech, Danielle
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Feature Progress

              Story Point Burn-up: (84.21%)

              Feature Estimate: 8.0

              IssuesStory Points
              To Do11.0
              In Progress   12.0
              Complete716.0
              Total919.0

              Dates

                Created:
                Updated:

                Structure Helper Panel