When I deployed using the following command, the pod was able to run normally.
helm upgrade -i nvdp nvdp/nvidia-device-plugin
--namespace nvidia-device-plugin
--create-namespace
--version 0.17.1 (or 1.9.0)
Then, according to the recommendation on the main page(https://github.com/NVIDIA/k8s-device-plugin/tree/main), when using the following command, the pod failed to start and an Error occurred.
cat << EOF > /tmp/dp-example-config0.yaml
version: v1
flags:
migStrategy: "none"
failOnInitError: true
nvidiaDriverRoot: "/"
plugin:
passDeviceSpecs: false
deviceListStrategy: envvar
deviceIDStrategy: uuid
EOF
helm upgrade -i nvdp nvdp/nvidia-device-plugin
--version=0.17.1
--namespace nvidia-device-plugin
--create-namespace
--set-file config.map.config=/tmp/dp-example-config0.yaml
Versions tried were 0.17.1, 0.17.2, 0.18.1, and 0.19.0, all resulting in the same outcome (pod startup error)
My environment is as follows:
OS :ubuntu 20.04
K8S: v1.29.15
Helm: v3.18.6
GPU: 2 * RTX 4060 (2882)
When I deployed using the following command, the pod was able to run normally.
helm upgrade -i nvdp nvdp/nvidia-device-plugin
--namespace nvidia-device-plugin
--create-namespace
--version 0.17.1 (or 1.9.0)
Then, according to the recommendation on the main page(https://github.com/NVIDIA/k8s-device-plugin/tree/main), when using the following command, the pod failed to start and an Error occurred.
cat << EOF > /tmp/dp-example-config0.yaml
version: v1
flags:
migStrategy: "none"
failOnInitError: true
nvidiaDriverRoot: "/"
plugin:
passDeviceSpecs: false
deviceListStrategy: envvar
deviceIDStrategy: uuid
EOF
helm upgrade -i nvdp nvdp/nvidia-device-plugin
--version=0.17.1
--namespace nvidia-device-plugin
--create-namespace
--set-file config.map.config=/tmp/dp-example-config0.yaml
Versions tried were 0.17.1, 0.17.2, 0.18.1, and 0.19.0, all resulting in the same outcome (pod startup error)
My environment is as follows:
OS :ubuntu 20.04
K8S: v1.29.15
Helm: v3.18.6
GPU: 2 * RTX 4060 (2882)