Uploaded image for project: 'SAFe Program'
  1. SAFe Program
  2. SP-4871

Data Management v0.1 - Roadmap

Change Owns to Parent OfsSet start and due date...
    XporterXMLWordPrintable

Details

    • Data Management v0.1 - Roadmap
    • SRCnet
    • 0

    Description

      Implementation Overview for Local Data Management and Proxy Service

      To facilitate proxying requests to a potentially firewalled SODA/visualization service while managing permissions checks, we propose developing a Local Data Management Service (LDMS). This service will serve as an intermediary to link data files from the Rucio Data Lake to local user environments, ensuring secure and authorized access to data.

      Key Functionalities of the Local Data Management Service (LDMS)

      1. *User Token Validation*:

      • The LDMS will accept user tokens to authenticate users against the SRCNet Identity and Access Management (IAM) system.
      • It will determine the corresponding local user using the CANFAR PosixMapper, ensuring consistent mapping of network users to local users.

      2. *Data File Linking*:

      • Upon receiving a request with specific data file IDs, the LDMS will create symbolic links to the requested files in a scratch area attached to the local computing environment.
      • The service will ensure the local RSE (Rucio Storage Element) is mounted in read-only mode to safeguard against unauthorized modifications.

      3. *Proxying Requests*:

      • The LDMS will act as a proxy, relaying requests from the client to the SODA service while enforcing permissions checks based on the IAM group memberships linked to the user's access token.
      • The service will implement WLCG path-based authorization, ensuring that tokens provide access to specific paths on the storage.

      4. *Configuration and Synchronization*:

      • The LDMS will require modifications to the Rucio server configuration to point to SRCNet IAM for permissions checks.
      • The sync script in the Rucio task manager will need adjustments to accommodate this new configuration.

      5. *API Development*:

      • The LDMS will implement a RESTful API in line with the interface defined in JIRA SP-4678, including methods for linking data files and validating permissions.

      Architecture Components

      • *Web Service*: The LDMS will be implemented as a web service that can be installed at each SRCNet site, designed to handle incoming requests and manage local file links.
      • *Integration with CANFAR*: By leveraging CANFAR’s PosixMapper, the service will translate network user IDs from the user tokens to local filesystem paths.
      • *Storage Access*: Ensure that the service can interact with Rucio and other storage technologies, such as S3 and Ceph, to facilitate data access and management.

      Pending Actions

      1. *Modify Rucio Server Configuration*:

      • Update configurations to redirect permission checks to SRCNet IAM.
      • Synchronize user accounts and permissions using the updated configuration.

      2. *Develop the LDMS*:

      • Create the service architecture, including the API, user authentication, and data linking functionality.
      • Implement logging and error handling mechanisms for robust operation.

      3. *Testing*:

      • Conduct thorough testing to ensure that permissions checks are functioning correctly and that data linking works as intended.
      • Test the proxy functionality to ensure that the SODA service can process requests securely.

      4. *Documentation*:

      • Document the service setup, configuration changes, and API usage for future reference and training of site operators.

      Additional Considerations

      • *Performance*: Benchmark the LDMS to ensure it can handle the expected load, particularly with large datasets.
      • *Scalability*: Design the service with scalability in mind to accommodate growing data volumes and user requests.
      • *Monitoring*: Integrate with existing monitoring solutions (e.g., Grafana) to track usage metrics and service health.

      By implementing the Local Data Management Service with these functionalities and considerations, we will enhance the SRCNet ecosystem, providing a secure and efficient means to manage and access data across firewalled environments.

      Attachments

        Issue Links

          Structure

            Activity

              People

                Debashis.Mitra Mitra, Debashis
                Debashis.Mitra Mitra, Debashis
                Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                  Created:
                  Updated:

                  Structure Helper Panel