Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-3509

SRC workflow example: Image cutouts

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Feature
    • Should have
    • PI19
    • None
    • None
    • SRCnet
    • Hide

      SRCs will provide pre-built scripts for standard workflows that users expect to run. We aim to implement some of these, and test their effectiveness with regards to hardware limitations and data storage schemes. 

      This particular workflow looks at providing the user with lots of small cut outs of individual sources based on an input list of RA/DEC location. To start off with, a single image is in one location, which is then extended to searching multiple images in the same location.

      In the future, this will be developed into a more expansive test where thousands of images can be distributed across storage systems, and this workflow can search all of them for the users requested sources, and return cut outs of them to a single location for the user.

      Show
      SRCs will provide pre-built scripts for standard workflows that users expect to run. We aim to implement some of these, and test their effectiveness with regards to hardware limitations and data storage schemes.  This particular workflow looks at providing the user with lots of small cut outs of individual sources based on an input list of RA/DEC location. To start off with, a single image is in one location, which is then extended to searching multiple images in the same location. In the future, this will be developed into a more expansive test where thousands of images can be distributed across storage systems, and this workflow can search all of them for the users requested sources, and return cut outs of them to a single location for the user.
    • Hide

      AC1: Have a containerised script that downloads a LOFAR LOTSS image, then gets cut-outs images of individual sources based on a users list of RA/DEC positions and size.

      AC2: Extend this script to download multiple LOFAR LOTSS images and search through them in the same storage location to get cut-outs. e.g. checking fits header information on accessible images.

      (not done) Stretch AC3: Upload a handful of LOFAR LOTSS images (that overlap with SDSS and other relevant multi-wavelength data) to a single Rucio RSE.

      (not done) Stretch AC4: Enable the script to download data from Rucio instead of the LOFAR archive.

      (not done) Stretch AC5: Obtain cut-outs from a distributed survey data set and return those cut-outs to a single location for the user.

      Show
      AC1: Have a containerised script that downloads a LOFAR LOTSS image, then gets cut-outs images of individual sources based on a users list of RA/DEC positions and size. AC2: Extend this script to download multiple LOFAR LOTSS images and search through them in the same storage location to get cut-outs. e.g. checking fits header information on accessible images. (not done) Stretch AC3: Upload a handful of LOFAR LOTSS images (that overlap with SDSS and other relevant multi-wavelength data) to a single Rucio RSE. (not done) Stretch AC4: Enable the script to download data from Rucio instead of the LOFAR archive. (not done) Stretch AC5: Obtain cut-outs from a distributed survey data set and return those cut-outs to a single location for the user.
    • 1
    • 1
    • 0
    • Team_CORAL
    • Sprint 3
    • Show
      Demo presented on the System Demo 19.4 part I (27th July).  Slides: https://docs.google.com/presentation/d/159tl5ZOG3DLq4s74-PgsDzZpYxxRa7I3fUzmuL4Ez78/edit?usp=sharing Recording: https://confluence.skatelescope.org/display/SRCSC/2023-07-27+SRC+ART+System+Demo+19.4+Part+1 Outcomes:  Repository: https://gitlab.com/ska-telescope/src/src-workloads  
    • 20.6
    • Stories Completed, Integrated, Solution Intent Updated, BDD Testing Passes (no errors), Outcomes Reviewed, NFRS met, Demonstrated, Satisfies Acceptance Criteria, Accepted by FO
    • PI24 - UNCOVERED

    Description

      Input data: Large images.

      Output data: Small images cut out of single sources.

      Software involved: Python astropy.

      Workflow steps: Define coordinates and area, run a script to cut out that image.

      State of existing workflows: Scripts do exist depending on the data availability. E.g. how to search through thousands of images to find the relevant one. Should be fairly straightforward to write a script from scratch. A small amount of work for cutting out from one image, a moderate amount of work to cut out searching from thousands of images. Data can be distributed also, and users could give a list of sources as an inputs to obtain hundreds of cutouts.

      Architecture and hardware: CPU. Distributed data. 

      Attachments

        Issue Links

          Structure

            Activity

              People

                A.Clarke Clarke, Alex
                r.bolton Bolton, Rosie
                Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (100.00%)

                  Feature Estimate: 1.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete58.0
                  Total58.0

                  Dates

                    Created:
                    Updated:
                    Resolved:

                    Structure Helper Panel