I have followed every version of the instructions on the AWS-EC2 setup for RAPIDS.ai: https://rapids.ai/cloud#AWS-EC2
I can confirm that I am using the exact instance type in the instructions, and following the steps exactly.
When I try to use the docker approach, the --gpus all
command is not accepted.
When I try to use the conda approach, the install fails with the error:
PackageNotFoundError: Packages missing in current channels:
- glibc
I have tried (many) different solutions provided to solve both of these problems, none of them seem to work. I really just need to test some python code with cuml
and cudf
imports in a notebook. Been at this for 7 hours (after giving up on my local and SageMaker).
2
Answers
Turns out, the frist AMI suggested in the documentation is not compatible. Use the Deep Learning NVIDIA one instead.
You note that the
--gpus all
command is not accepted, which suggests that you do not have the NVIDIA Docker runtime installed.I followed the instructions you linked and I did run into an issue where the
sudo yum install -y nvidia-docker2
command failed and I needed to disable an Amazon yum repo that was causing come conflicts as outlined in this issue.Once I’d done that and run
sudo systemctl restart docker
I was able to start the RAPIDS container.