Upgrade NVIDIA Device Plugin

EKF 2.0 uses NVIDIA device plugin v0.12.2, which uses Ubuntu as its base image. This guide will walk you through upgrading NVIDIA device plugin from version v1.0.0-beta4 to version v0.12.2.

Optional

This guide is optional. If you have not created a GPU-enabled node group or are not planning to create one, proceed to the What’s Next section.

Procedure

  1. Go to your GitOps repository, inside your rok-tools management environment:

    root@rok-tools:~# cd ~/ops/deployments
  2. Apply the kustomization:

    root@rok-tools:~/ops/deployments# rok-deploy --apply rok/nvidia-device-plugin/overlays/deploy

Verify

  1. Verify that the NVIDIA device plugin is up and running. Verify that the value of field READY is equal to the value of field DESIRED:

    root@rok-tools:~# kubectl get -n kube-system daemonset/nvidia-device-plugin-daemonset NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE nvidia-device-plugin-daemonset 2 2 2 2 2 <none> 38d

Summary

You have successfully upgraded the NVIDIA device plugin of your EKF cluster.

What’s Next

The next step is to upgrade ExternalDNS.