한 번만 더 해보자

[tao-toolkit] nvidia-container-cli: initialization error: nvml error: driver not loaded: unknown 본문

Deep Learning

[tao-toolkit] nvidia-container-cli: initialization error: nvml error: driver not loaded: unknown

정 하임 2024. 6. 27. 22:22

에러코드

nvidia-container-cli: initialization error: nvml error: driver not loaded: unknown"

 

 

 

상황

타오툴킷으로 학습하려는데 위와 같은 에러 발생

 

 

해결방법

  1. nvidia-container-cli가 설치되어있지 않은 경우

Installing the NVIDIA Container Toolkit — NVIDIA Container Toolkit 1.15.0 documentation

 

Installing the NVIDIA Container Toolkit — NVIDIA Container Toolkit 1.15.0 documentation

Install an NVIDIA GPU Driver if you do not already have one installed. You can install a driver by using the package manager for your distribution, but other installation methods, such as downloading a .run file intaller, are available. Refer to the NVIDIA

docs.nvidia.com

 

 

설치 방법

$ curl -fsSL <https://nvidia.github.io/libnvidia-container/gpgkey> | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \\
  && curl -s -L <https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list> | \\
    sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \\
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
  
    
$ sudo apt-get update

$ sudo apt-get install -y nvidia-container-toolkit
  1. nvidia driver가 없는 경우

최신 공식 NVIDIA 드라이버 다운로드

위의 링크에서 버전에 맞는 드라이버 다운로드 후 설치

반응형