Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-3928

SRC workflow example: deployment of processMeerKAT pipeline

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • SRCnet
    • Hide

      Integrating the ProcessMeerKAT pipeline into the SRCNet workflow repository will give us the opportunity to test/study how to fulfill different requirements. In particular this workflow:

      • requires a different computing environment (SLURM )
      • is designed to deal with big datasets and it will produce high workloads

      This feature will also produce documentation on how to adapt the pipeline to SRCNet computing infrastructures, ensuring compatibility and accessibility for several SRCNet infrastructures.

      By achieving this integration, the SRCNet workflow repository will be enhanced with a powerful pipeline for MeerKAT data processing, facilitating collaborative research and detecting the necessities  of the computing infrastructures within the SRCNet community.

      Show
      Integrating the ProcessMeerKAT pipeline into the SRCNet workflow repository will give us the opportunity to test/study how to fulfill different requirements. In particular this workflow: requires a different computing environment (SLURM ) is designed to deal with big datasets and it will produce high workloads This feature will also produce documentation on how to adapt the pipeline to SRCNet computing infrastructures, ensuring compatibility and accessibility for several SRCNet infrastructures. By achieving this integration, the SRCNet workflow repository will be enhanced with a powerful pipeline for MeerKAT data processing, facilitating collaborative research and detecting the necessities  of the computing infrastructures within the SRCNet community.
    • Hide

      AC1: Successful Deployment of ProcessMeerKAT Workflow able to analyse big datasets: The SRC workflow example involving the deployment of the ProcessMeerKAT pipeline must be successfully implemented within the SRCNet environment in at least one SRC with access to the data. Verification: it will involve confirmation of a scientific team that the ProcessMeerKAT workflow has been deployed, executed, and completed successfully within the SRCNet environment. This will include evidence of calibrated and imaged MeerKAT data.

      AC2: SRCNet repository with a new entry gathering the ProcessMeerKAT Workflow, including all the element (documentation, input data and parameters) to be executed and produce a high workload. Verification: a Demo

      AC3:  Documentation on how to deploy the pipeline:  documentation must be created detailing the deployment of the ProcessMeerKAT pipeline, including step-by-step instructions, configuration settings, and any computing dependencies. Verification: a wiki document

      Show
      AC1:  Successful Deployment of ProcessMeerKAT Workflow able to analyse big datasets: The SRC workflow example involving the deployment of the ProcessMeerKAT pipeline must be successfully implemented within the SRCNet environment in at least one SRC with access to the data. Verification: it will involve confirmation of a scientific team that the ProcessMeerKAT workflow has been deployed, executed, and completed successfully within the SRCNet environment. This will include evidence of calibrated and imaged MeerKAT data. AC2: SRCNet repository with a new entry gathering the ProcessMeerKAT Workflow, including all the element (documentation, input data and parameters) to be executed and produce a high workload. Verification : a Demo AC3:  Documentation on how to deploy the pipeline:  documentation must be created detailing the deployment of the ProcessMeerKAT pipeline, including step-by-step instructions, configuration settings, and any computing dependencies. Verification : a wiki document
    • Intra Program
    • 4
    • 0
    • PI24 - UNCOVERED

    Description

      IDIA has developed ProcessMeerKAT, a pipeline to do calibration and imaging of MeerKAT interferometric data. This pipeline is published in this repository[ https://idia-pipelines.github.io/docs/processMeerKAT|https://idia-pipelines.github.io/docs/processMeerKAT]

       

      This pipeline has been designed to be executed on a SLURM cluster and the espSRC team did some work in previous PIs to get this pipeline deployed on the espSRC platform ( https://jira.skatelescope.org/browse/SP-3082 ). In particular the SLURM cluster was deployed, the pipeline was installed  and some small tests were done.  When deploying the pipeline on the espSRC platform, some issues were found when trying to analyze bigger datasets.

       

      The work proposed here is to 1) scale up the ProcessMeerKAT deployment so it can be used with big datasets and 2) include this pipeline in the SRCNet repository. Since the pipeline software is already published at the IDIA repository, the work would mainly consist in gathering a parameter file and a sample input data to run the pipeline with a significant workload. Documentation will be added to know how to run the workflow. The SRCNet repository will also contain specific documentation to facilitate the deployment of the pipeline including the requirements in terms of hardware requirements.

       

      By leveraging IDIA's developed pipeline, SRCNet users can benefit from a standardised and well-documented workflow, enhancing the accessibility to the pipeline and effectiveness on several computing platforms for the MeerKAT data processing pipeline.

      Attachments

        Issue Links

          Structure

            Activity

              People

                r.bolton Bolton, Rosie
                M.Parra Parra, Manuel
                Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (0%)

                  Feature Estimate: 4.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete00.0
                  Total00.0

                  Dates

                    Created:
                    Updated:

                    Structure Helper Panel