Loading...

Xporter

XML

Word

Printable

Details

Type: Enabler
Priority: Should have
Fix Version/s: PI21
Component/s: COM SDP SW, COM Software Platform
Labels:
None

ARTs:

Data Processing
Benefit hypothesis:

Hide

Provide fundamental service to enable the ability to ingest, track, trace, find and access data products of any kind within the SKAO. This will be using a generic data management schema initially, but allow for product specific schemata as well.

Show
Provide fundamental service to enable the ability to ingest, track, trace, find and access data products of any kind within the SKAO. This will be using a generic data management schema initially, but allow for product specific schemata as well.
Acceptance criteria:
Hide

Introduce service that can be used to track for intermediate and final data products by ID (as of ~~ADR-54~~):

data product metadata (initially as of ~~ADR-54~~)

Location information (should allow a number of storage backends, including informal ones like something that is in storage on an HPC system like CSD3 or Pawsay)

Lifecycle information (at minimum lifetime)

Updateable user annotations?

Integrate into SDP

Stretch - have data product dashboard show information from it

Stretch - consider how this could be ported/synchronised to enable a global SRC infrastructure

Stretch - consider authentication (this would need to go via execution block ID metadata)
Show
Introduce service that can be used to track for intermediate and final data products by ID (as of ADR-54 ): data product metadata (initially as of ADR-54 ) Location information (should allow a number of storage backends, including informal ones like something that is in storage on an HPC system like CSD3 or Pawsay) Lifecycle information (at minimum lifetime) Updateable user annotations? Integrate into SDP Stretch - have data product dashboard show information from it Stretch - consider how this could be ported/synchronised to enable a global SRC infrastructure Stretch - consider authentication (this would need to go via execution block ID metadata)
Expect Dependencies:

Inter Program, Intra Program
Feature Points:
5
Initial Size:
5
WSJF:
0
Informed By:

SP-3667 Review and consolidate design/architecture for data management
Epic Link:
AA0.5 Data Management & Analysis
Agile Teams:

Team_YANDA
Due Sprint:
Sprint 5
Story Point Burn-up:
Overdue:
Outcomes:
Hide

The implemented services can be used to track for intermediate and final data products by UID, OID and the semantic ID (as of ~~ADR-54~~):

Location information allows multiple storage backends, including informal ones like something that is in storage on local storage and Acacia Pawsay)

Lifecycle information both for the whole set of related products, replicas and individual items.

Updateable user annotations as JSON keyword/value pairs.

Integrated into SDP with make process, helm charts, testing and readthedocs.

data product dashboard has been deferred due to reduce availability of Naledi team during this PI.

Consideration how this could be ported/synchronised to enable data movements across a global SRC infrastructure.

AAA is currently a stub and will be started next PI.
Show
The implemented services can be used to track for intermediate and final data products by UID, OID and the semantic ID (as of ADR-54 ): Location information allows multiple storage backends, including informal ones like something that is in storage on local storage and Acacia Pawsay) Lifecycle information both for the whole set of related products, replicas and individual items. Updateable user annotations as JSON keyword/value pairs. Integrated into SDP with make process, helm charts, testing and readthedocs. data product dashboard has been deferred due to reduce availability of Naledi team during this PI. Consideration how this could be ported/synchronised to enable data movements across a global SRC infrastructure. AAA is currently a stub and will be started next PI.

Requirement Status:

PI22 - UNCOVERED
Goals_MIRO:
Low G3 Mid G3

Description

See feature breakdown on PI21 backlog board.

Old description:

Following ~~SP-3667~~ we need to make a start on the implementation of a service underlying the SKA Data Management.

This acknowledges that at this point in time we can not really make final decisions on:

Responsibilities (setup, design, operations)
Technology choice and installation (e.g. Postgres)
DB schema design

Therefore we should first focus on building a service-like layer, with a proof-of-concept implementation behind it.

Links and References

SI: Data Management: https://confluence.skatelescope.org/pages/viewpage.action?pageId=159387101

Attachments

Issue Links

is informed by

SP-3667 Review and consolidate design/architecture for data management

Done

Structure

Activity

People

Assignee:: Bartolini, Marco

Reporter:: Andreas Wicenec

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Feature Progress

Story Point Burn-up: (100.00%)

Feature Estimate: 5.0

	Issues	Story Points
To Do	0	0.0
In Progress	0	0.0
Complete	20	33.0
Total	20	33.0

Dates

Created:: 09/Oct/23 3:00 AM

Updated:: 06/Mar/24 6:09 AM

Initial implementation of data management