Questions tagged [rapids]

RAPIDS is a framework for accelerated machine learning and data science on GPUs

Questions pertaining to RAPIDS. From https://rapids.ai/ :

The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

195 questions
0
votes
1 answer

How do I use RAPIDS on python if it is not working

Hi guys I am trying to acces gpu accelerated regression trees through cuml from rapids. However I can't seem to be able to download rapids as instructed> I am trying to dwonload through anaconds prompt in the working directory using the stated…
0
votes
1 answer

How can I get cuML RandomForestClassifier leafs?

I'm new to cuML and I have a decision tree classifier using scikit learn. I would like to perform some hyperparameter search using the GPU, so I started looking cuML. There is no DecisionTreeClassifier in cuML, but it can be reproduced by using…
xerac
  • 147
  • 8
0
votes
1 answer

How to save the data drawn by cuxfilter as an image

I am considering drawing a network. For example, as shown in the demo, we can use the dashboard to get the chart, but there is no "save button" on the right side, as is often the case. cux_df = cuxfilter.DataFrame.load_graph((nodes, edges)) chart0…
felntc
  • 13
  • 2
0
votes
1 answer

Cupy config shows no cuda device

I built a docker contaiener to be deployed on HPC+GPU via singularity. when i run cp.show_config() OS: Linux-5.4.0-135-generic-x86_64-with-glibc2.31 Python Version: 3.9.12 CuPy Version: 9.6.0 CuPy Platform: NVIDIA CUDA NumPy Version: 1.21.5 SciPy…
fabio.geraci
  • 305
  • 2
  • 5
  • 18
0
votes
0 answers

Docker exec container pytest fail

I am using the dev image so i can have the cuda compiler, now the issue is that when running the CI, as below I get that error, but if I build the standard container (commented line in dockerfile). CONTAINER=$(docker run -d gpu-test) docker exec…
fabio.geraci
  • 305
  • 2
  • 5
  • 18
0
votes
1 answer

Clear all cached kernels from CuPY to force kernel compilation

In the CuPY documentation, it is stated that "CuPy caches the kernel code sent to GPU device within the process, which reduces the kernel compilation time on further calls." This means that when one calls a function from CuPY, subsequent calls to…
JOKKINATOR
  • 356
  • 1
  • 11
0
votes
1 answer

RuntimeError: CUDA error encountered, when using cuml

When I use rapids I always meet errors: Now, I run: from cuml.datasets.regression import make_regression data, values = make_regression(n_samples=200000, n_features=50, n_informative=7, bias=-4.2, …
Chao Li
  • 11
  • 3
0
votes
0 answers

Error while importing cuml, ModuleNotFoundError: No module named 'pylibraft'

While importing the library cuml in WSL Ubuntu. import cuml Traceback (most recent call last): File "", line 1, in File "/usr/local/lib/python3.8/dist-packages/cuml/__init__.py", line 17, in from cuml.common.base…
abhipraja
  • 11
  • 3
0
votes
0 answers

Fail to extend existing docker file - rapidsai/rapidsai - adding packages that would be installed using conda

I am trying create a new docker file, that uses existing docker file: rapidsai/rapidsai:22.04-cuda11.0-runtime-ubuntu18.04-py3.8 and install two new packages on the existing Conda environment. What I tried: FROM…
Ilan Geffen
  • 179
  • 8
0
votes
1 answer

apache-spark GPU installation with RAPIDS

sorry i'm new to the world spark, I should install it on a cluster that has a node with 2 nvidia AMPERE GPUs, I find written in this NVIDIA link in the Install Spark paragraph that: "note that only scala version 2.12 is currently supported by the…
chpiero
  • 41
  • 5
0
votes
2 answers

cugraph create NoneType

I tried to create a Graph from a dask_cudf DataFrame, but the Graph get Nonetype without error Message. I tried it with the same data set also with a pandas dataframe. Then I tried it with three sample edges. Each time a NoneType object. However, if…
padul
  • 134
  • 11
0
votes
1 answer

User defined function to combine CUDF dataframe columns

As per the title, I am trying to combine the row values from different cudf.DataFrame columns. The following code works for a standard pandas.DataFrame: import pandas as pd data = {'a': [1], 'b': [2], 'c': [3], 'd': [4]} df =…
epifanio
  • 1,228
  • 1
  • 16
  • 26
0
votes
0 answers

cuDF support for emoji_patterns

Is there a faster way to clean emojis from a cuDF string series? I am currently using emoji == 1.7.0 and retrieving the regex emoji patterns (since cuDF doesnt support the emoji library directly to do a emoji.get_emoji_regexp().sub("", string)…
ZooPanda
  • 331
  • 3
  • 11
0
votes
1 answer

GPU vs CPU memory usage in RAPIDS

I understand that GPU and CPU have their own RAM, but what I dont understand is why the same dataframe, when loaded in pandas vs RAPIDS cuDF, have drastically different memory usage. Can somebody explain?
ZooPanda
  • 331
  • 3
  • 11
0
votes
1 answer

Merge output from cugraph over vertex_id with input data

If I create a graph with cugraph and then calculate position from the nodes or communities, I get a dataframe with information and a vertex id. So I have three questions: How is the vertex id created? Is there a way to merge the output data over…
padul
  • 134
  • 11