Use Arrikto vGPUs¶
Currently, Kubernetes assigns GPUs exclusively to Pods. This is especially inefficient in interactive scenarios, such as development using a Jupyter notebook server, in which a Pod, or application, has large idle periods.
Kiwi is a mechanism that enables multiple containers (belonging to the same or different Pods) to run on the same GPU concurrently, each one having the whole GPU memory available for use.
This section will guide you through using vGPUs for your workloads. Here is what you’ll need to follow this guide:
In order to consume an Arrikto vGPU, you need to request exactly one
arrikto.com/gpu device in the
limits section of the
of your container. To do this, add the following snippet to the desired
When creating a Jupyter notebook server, specify exactly one
arrikto.com/gpu in the
GPUs section of the Jupyter Web App.
Additionally, make sure to use a GPU-enabled Kale image, as you would normally do for your GPU-enabled notebook.
Specify exactly one Arrikto vGPU in the GPU-specific annotation modal that appears after clicking the GPU button when annotating a step.