Node prerequisites for binding to FEDML Platform
In order to bind node to FEDML Platform, you need to have NVIDIA GPU(s) with CUDA drivers installed.
To check if Nvidia drivers are installed, run the following command:
nvidia-smi
If you see similar output as below with GPU details, you are good to go:
tip
The precise results you see on your terminal would vary may vary based on the number of GPU cards and the type of GPU cards available on your node. So consider the screenshots above as mere examples to ensure it appears somewhat similar.
If you don't have Nvidia drivers installed, you can first try to install them using the following command:
sudo ubuntu-drivers autoinstall
sudo apt install nvidia-cuda-toolkit
sudo reboot
nvidia-smi
If the above commands don't work, you can follow the instructions on NVIDIA's official website: https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html