Loading...

Change Owns to Parent Ofs

Set start and due date...

Xporter

XML

Word

Printable

Details

Type: Feature
Priority: Should have
Fix Version/s: PI21
Component/s: SRCnet Science Enabling
Labels:
- Team_Magenta

ARTs:

SRCnet
Benefit hypothesis:

Hide

All of the current tasks in the workloads Gitlab repository use small/minimal datasets to demonstrate the task. Moving forward we want to scale some of these up to use much larger datasets. This can be to test the task can handle significant datasets, but also to test the compute infrastructure running the task. Note that this is not just running the same task over and over again, but using a significantly larger dataset.

Show
All of the current tasks in the workloads Gitlab repository use small/minimal datasets to demonstrate the task. Moving forward we want to scale some of these up to use much larger datasets. This can be to test the task can handle significant datasets, but also to test the compute infrastructure running the task. Note that this is not just running the same task over and over again, but using a significantly larger dataset.
Acceptance criteria:

Hide

AC1: Develop a scaled up version of PYBDSF: run on multiple images.

AC2: add alternate version to gitlab.

Show
AC1: Develop a scaled up version of PYBDSF: run on multiple images. AC2: add alternate version to gitlab.
Feature Points:
1
Initial Size:
1
WSJF:
0
Epic Link:
Collation of Runnable Example Workflows
Agile Teams:

Team_MAGENTA
Due Sprint:
Sprint 5
Story Point Burn-up:
Overdue:
Outcomes:

Hide

Example notebook: https://gitlab.com/ska-telescope/src/src-workloads/-/blob/master/tasks/source-finding-pybdsf/jupyter/scripts/pybdsf-sf-with-rucio.ipynb?ref_type=heads

Demoed: https://confluence.skatelescope.org/display/SRCSC/2024-02-15+SRC+ART+System+Demo+21.5+Part+2+PM

Show
Example notebook: https://gitlab.com/ska-telescope/src/src-workloads/-/blob/master/tasks/source-finding-pybdsf/jupyter/scripts/pybdsf-sf-with-rucio.ipynb?ref_type=heads Demoed: https://confluence.skatelescope.org/display/SRCSC/2024-02-15+SRC+ART+System+Demo+21.5+Part+2+PM
Resolved PI.Sprint:
21.6

Feature Checklist:

Stories Completed, Demonstrated, Satisfies Acceptance Criteria, Accepted by FO

Requirement Status:

PI24 - UNCOVERED
Labels_MIRO:
Team_Magenta

Description

All of the current tasks in the workloads Gitlab repository use small/minimal datasets to demonstrate the task. Moving forward we want to scale some of these up to use much larger datasets. This can be to test the task can handle significant datasets, but also to test the compute infrastructure running the task. This will significantly increase the compute time, and will help test more realistic workloads being run at SRCs. Note that this is not just running the same task over and over again, but using a significantly larger dataset.

This feature would selection at least one of the tasks, and scale it up. It could be split into multiple features, per tasks that are relevant to scale up.

Run PYBDSF source-finding on a significantly larger set of images.

Mosaicking large areas of sky.

Image-cutouts for thousands/millions of sources from multiple images.

Image convolution - convolving many images to alter the resolution.

CNN image classifier - use a much larger dataset of images.

Develop a script that runs all tasks in their default state, to test the functionality of an SRC node.

Attachments

Issue Links

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

Structure

Activity

People

Assignee:: Bolton, Rosie

Reporter:: Clarke, Alex

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Feature Progress

Story Point Burn-up: (100.00%)

Feature Estimate: 1.0

	Issues	Story Points
To Do	0	0.0
In Progress	0	0.0
Complete	3	5.0
Total	3	5.0

Dates

Created:: 15/Nov/23 12:40 PM

Updated:: Yesterday 6:17 AM

Resolved:: 26/Feb/24 11:14 AM

Due Sprint Date:: 20/Feb/24

Scale up a task in the Example workflow repository