0

I have followed every version of the instructions on the AWS-EC2 setup for RAPIDS.ai: https://rapids.ai/cloud#AWS-EC2

I can confirm that I am using the exact instance type in the instructions, and following the steps exactly.

When I try to use the docker approach, the --gpus all command is not accepted.

When I try to use the conda approach, the install fails with the error:

PackageNotFoundError: Packages missing in current channels:
            
 - glibc

I have tried (many) different solutions provided to solve both of these problems, none of them seem to work. I really just need to test some python code with cuml and cudf imports in a notebook. Been at this for 7 hours (after giving up on my local and SageMaker).

stephenlcurtis
  • 180
  • 1
  • 7

2 Answers2

2

You note that the --gpus all command is not accepted, which suggests that you do not have the NVIDIA Docker runtime installed.

I followed the instructions you linked and I did run into an issue where the sudo yum install -y nvidia-docker2 command failed and I needed to disable an Amazon yum repo that was causing come conflicts as outlined in this issue.

$ sudo yum-config-manager --disable amzn2-graphics

$ sudo yum install -y nvidia-docker2

$ sudo yum-config-manager --enable amzn2-graphics

Once I'd done that and run sudo systemctl restart docker I was able to start the RAPIDS container.

$ docker run --gpus all --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786     rapidsai/rapidsai:cuda11.2-runtime-ubuntu18.04-py3.7
This container image and its contents are governed by the NVIDIA Deep Learning Container License.
By pulling and using the container, you accept the terms and conditions of this license:
https://developer.download.nvidia.com/licenses/NVIDIA_Deep_Learning_Container_License.pdf

A JupyterLab server has been started!
To access it, visit http://localhost:8888 on your host machine.
Ensure the following arguments were added to "docker run" to expose the JupyterLab server to your host machine:
      -p 8888:8888 -p 8787:8787 -p 8786:8786
Make local folders visible by bind mounting to /rapids/notebooks/host
(rapids) root@be7253bb4fdb:/rapids/notebooks#
Jacob Tomlinson
  • 3,341
  • 2
  • 31
  • 62
1

Turns out, the frist AMI suggested in the documentation is not compatible. Use the Deep Learning NVIDIA one instead.

stephenlcurtis
  • 180
  • 1
  • 7