Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-3512

SRC workflow example: Source Classification (not CNN)

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Feature
    • Could have
    • PI20
    • None
    • SRCnet
    • Hide

      SRCs will provide pre-built scripts for standard workflows that users expect to run. We aim to implement some of these, and test their effectiveness with regards to hardware limitations and data storage schemes. 

      This particular workflow looks at source classification using a Random Forest or similar parameter space exploration.

      It can make use of distributing the model training or classification.

      Show
      SRCs will provide pre-built scripts for standard workflows that users expect to run. We aim to implement some of these, and test their effectiveness with regards to hardware limitations and data storage schemes.  This particular workflow looks at source classification using a Random Forest or similar parameter space exploration. It can make use of distributing the model training or classification.
    • Hide

      AC1: Deploy the SDC1 source classification workflow on Git.

      AC2: Adapt the workflow to distribute the model training and classifying.

      Show
      AC1: Deploy the SDC1 source classification workflow on Git. AC2: Adapt the workflow to distribute the model training and classifying.
    • 1
    • 0
    • Overdue
    • PI23 - UNCOVERED

    • PI20-PB

    Description

      Input data: catalog data, preferably cross-matched with other wavelengths

      Output data: catalog of class labels with confidence 

      Software involved: Python machine learning libraries

      Workflow steps: Train a model on some labeled data, apply the model to unlabeled data.

      State of existing workflows: SDC1 solution has these workflows done already, but only applies to simulated SDC1 radio data, so the results are not very representative of actual science. Alex’s SDSS & WISE star/galaxy/quasar classification paper is a complete multi-wavelength example, but excludes radio. Lots of options here to distribute data, build models separately and then combine them later in one location. Distributed data workflow would be a lot of work to implement.

      Architecture and hardware: Multithreaded CPU. Flexible RAM. Data in a single location, or can distribute data for training/classification and combine models afterwards.

      Attachments

        Issue Links

          Structure

            Activity

              People

                r.bolton Bolton, Rosie
                r.bolton Bolton, Rosie
                Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Feature Progress

                  Story Point Burn-up: (0%)

                  Feature Estimate: 1.0

                  IssuesStory Points
                  To Do00.0
                  In Progress   00.0
                  Complete00.0
                  Total00.0

                  Dates

                    Created:
                    Updated:

                    Structure Helper Panel