Details
-
Spike
-
Should have
-
SRCnet
-
-
-
3
-
3
-
0
-
Team_CORAL
-
Sprint 5
-
-
-
-
24.3
-
Stories Completed, Outcomes Reviewed
-
science-platform-services
Description
Some applications and tools are designed to be cloud-native, assuming that infrastructure is available through Kubernetes interfaces. The container-based Science Platform developed by CADC is such an example, which treats all workloads as Kubernetes-native Jobs. This presents a challenge when the HPC resources are not part of the Kubernetes infrastructure, but are part of a separate system.
This work aims to make these cloud-native applications interoperable with HPC batch systems that are not Kubernetes based.
Tools like https://github.com/IBM/Bridge-Operator achieve a loose coupling between Kubernetes-based systems (such as CANFAR science platform) and separate batch systems.
In this feature, we will explore leveraging this approach, starting with the Bridge Operator in particular, but perhaps migrating to other solutions if need be.
The goal is to understand how to build an interface that allows K8s based applications to run their containerised services on HPC batch systems. We will use CANFAR to guide this exploration and provide a practical view on how to build this interface.