Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-2920

Platforms for testing AA2+ DP pipelines

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Services
    • Hide

      In order to demonstrate DP pipelines running at AA2+ scales access to significant computational resources is required (but at fairly low duty cycle).  A plan is needed on when and how these resources will be accessed so DP can plan accordingly.

      Show
      In order to demonstrate DP pipelines running at AA2+ scales access to significant computational resources is required (but at fairly low duty cycle).  A plan is needed on when and how these resources will be accessed so DP can plan accordingly.
    • Hide

      Catalogue of currently accessible facilities that could be used. A plan of future resources. Plan for access for these resources in period Q1 2023->Q1 2024

      Show
      Catalogue of currently accessible facilities that could be used. A plan of future resources. Plan for access for these resources in period Q1 2023->Q1 2024
    • Inter Program
    • 16.6
    • PI22 - UNCOVERED

    Description

      As the epic progress the DP team will need to carry out tests of DP pipelines at significant computational scale.

      Precise requirements are not known but basic modelling indicates DP tests will need to handle data sets up to of order 100 TB size, with  networked storage I/O greater than 10GB/s by the end of 2023. In 2023 Q1  we probably need smaller system (e.g., ~10TB size storage, scaled...) and as initial tests progress better estimates of future needs will be known.

      It is likely DP will need to test a range of machine configurations, e.g., large nodes with ~2TB RAM and 8 GPUS down to smaller single GPU nodes, possibly also just CPU nodes.

      The duty cycle of the tests is expected to be low, perhaps a few hours / week, with the most demanding tests only carried out a few times.

      Need to develop a plan for where, when and how these platforms will be accessible.

      Attachments

        Issue Links

          Structure

            Activity

              People

                m.deegan Deegan, Miles
                b.nikolic Nikolic, Bojan
                Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (0%)

                  Feature Estimate: 0.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete00.0
                  Total00.0

                  Dates

                    Created:
                    Updated:
                    Resolved:

                    Structure Helper Panel