Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-2375

Distributed Dask Performance

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Enabler
    • Should have
    • PI14
    • COM SDP SW
    • None
    • Data Processing
    • Hide

      See description

      Show
      See description
    • Hide
      • Further implementation refactoring of streaming FFT performance to provide clear processing function interface to core algorithm (i.e. identify planning data and functions)
      • Investigate whether we can demonstrate the ability of the algorithm to distribute image and subgrid data within Dask
      Show
      Further implementation refactoring of streaming FFT performance to provide clear processing function interface to core algorithm (i.e. identify planning data and functions) Investigate whether we can demonstrate the ability of the algorithm to distribute image and subgrid data within Dask
    • Inter Program
    • 5
    • 5
    • 5
    • 1
    • Team_ORCA
    • Sprint 5
    • Hide

      We have refactored the distributed Fourier transform code to provide a clean separation of the processing functions and the Dask workflow. The implementation is described in the API section of the documentation.

      We have started to investigate the memory consumption of the algorithm in the various scenarios described in ORC-1273. The results are shown on that ticket and in Memory Usage Investigation.

      Show
      We have refactored the distributed Fourier transform code to provide a clean separation of the processing functions and the Dask workflow. The implementation is described in the API section of the documentation . We have started to investigate the memory consumption of the algorithm in the various scenarios described in ORC-1273 . The results are shown on that ticket and in Memory Usage Investigation .
    • 17.4
    • Stories Completed, Outcomes Reviewed, NFRS met, Demonstrated, Satisfies Acceptance Criteria, Accepted by FO

    Description

      Follows work from SP-2086. Aims to address questions posed in goal SPO-1593

      Who?

      SDP pipeline developers, SDP architects

      What?

      • Identify and investigate the performance of Dask through focused testing using an i/o insensitive distributed processing pattern that targets one of the most challenging SDP imaging use cases.
        • This specifically means addressing the central scaling challenge: Keeping both image and grid data (and therefore visibilities once we get to gridding) distributed independently.
        • To prove this, we should show that we can effectively distribute both the image and subgrid load (i.e. demonstrate that memory requirements per node decrease as we add more nodes) - while still being able to correctly transform one into the other.
        • Note that checking correctness would generally require solving a full-size FFT here - it is therefore advisable to test with simple patterns where only a few (known) image or uv grid points are set in the input, as that means we can relatively cheaply predict any output uv grid or image point by direct evaluation of the Fourier transformation.

      Why?

      In the not too distant future (ideally by the end of PI14) we need to make a pivot or persevere decision on whether Dask will be used as an EF for SDP development or will we need to focus on other options. Dask is an attractive option since it has great community support and may improve interoperability with other astronomy software outside of SKA. We do believe there may be a number of problems with Dask scaling, and investigating is clearly very important if we are to adopt Dask for most of the SKA pipeline development for the foreseeable future.

      Attachments

        Issue Links

          Structure

            Activity

              People

                p.wortmann Wortmann, Peter
                b.mort Mort, Ben
                Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (100.00%)

                  Feature Estimate: 5.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete1441.5
                  Total1441.5

                  Dates

                    Created:
                    Updated:
                    Resolved:

                    Structure Helper Panel