Questions tagged [rapids]

RAPIDS is a framework for accelerated machine learning and data science on GPUs

Questions pertaining to RAPIDS. From https://rapids.ai/ :

The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

195 questions
1
vote
1 answer

ModuleNotFoundError: No module named 'cudf' in google colab

I tried importing cudf and get the following error: ModuleNotFoundError Traceback (most recent call last) in () ----> 1 import cudf; print('cuDF Version:', cudf.__version__) ModuleNotFoundError: No module…
Keerthi Sree
  • 13
  • 1
  • 3
1
vote
2 answers

Interpreting package requests conflicts for a failed conda install

Attempting the following conda install operation (derived from the NVIDIA RAPIDS installation instructions): conda config --prepend channels rapidsai && \ conda config --prepend channels nvidia && \ conda config --set channel_priority strict &&…
Aleksey Bilogur
  • 3,686
  • 3
  • 30
  • 57
1
vote
0 answers

cuML vs sklearn: different accuracies for random forest classifier

I am using the rapidsai docker container as obtained via docker pull rapidsai/rapidsai:cuda10.0-runtime-ubuntu18.04 docker run --gpus all --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 \ rapidsai/rapidsai:cuda10.0-runtime-ubuntu18.04 and have…
cryo111
  • 4,444
  • 1
  • 15
  • 37
1
vote
1 answer

Has anyone see error global_output_type with cuml RAPIDS?

I am new to RAPIDS, and just tried to replicate a logisticregression model previously with sklearn, but now with cuml, but it the LogisticRegression() call does not work. from cuml.linear_model import LogisticRegression,…
alf
  • 11
  • 2
1
vote
1 answer

`pip install cudf-cuda100` results in "ERROR: No matching distribution found for cudf-cuda100"

I run Windows 10 and have installed Anaconda. I am trying to install cudf but I repeatedly fail: (tf2) C:\WINDOWS\system32>pip install cudf-cuda100 ERROR: Could not find a version that satisfies the requirement cudf-cuda100 (from versions:…
user8270077
  • 4,621
  • 17
  • 75
  • 140
1
vote
1 answer

cuML functions running on DASK? and dask_cudf manipulation?

How to run dask_cuML (logistic regression for example) on a large dataset, dask_cudf? I can not run cuML on my cudf dataframe because dataset is large so "OUT of MEMORY" as soon as I try anything. Bright side is I got 4 GPUs to use with…
Salchem
  • 118
  • 1
  • 2
  • 11
1
vote
1 answer

Convert cuDF data frame column to 1 or 0 for “true”/“false” values

I am using RAPIDS (0.9 release) docker container. How can I do the following with RAPIDS cuDF? df['new_column'] = df['column_name'] > condition df[['new_column']] *= 1
rnyai
  • 25
  • 3
1
vote
1 answer

How to use cudf.Series.applymap()?

Can someone please provide a few examples of how to use the applymap method on a cuDF Series? Below is copied from the docs and here is a link to the documentation. applymap(self, udf, out_dtype=None) Apply a elemenwise function to transform the…
gumdropsteve
  • 70
  • 1
  • 14
1
vote
3 answers

How to apply if condition in GPU DataFrame- cuDF to filter the DataFrame?

I'd like to filter a cuDF data frame based on a column value, and then create a new column based on a condition specified. Basically, how can I apply the following in cuDF? df.loc[df.column_name condition, 'new column name'] = 'value if condition is…
rnyai
  • 25
  • 3
1
vote
1 answer

Python Nvidia rapids memory error when using cuml for training machine learning model

I am using python 3 with nvidia Rapids in order to speed up machine learning training using cuml library and a GPU. My scrips also uses keras with GPU training (over tf) and when I reach the stage where I try to use CUML I get memory error. I…
thebeancounter
  • 4,261
  • 8
  • 61
  • 109
1
vote
2 answers

How to drop columns with NA using cudf?

Pandas: data = data.dropna(axis = 'columns') I am trying to do something similar using a cudf dataframe but the apis don't offer this functionality. My solution is to convert to a pandas df, do the above command, then re-convert to a cudf. Is…
Sterls
  • 723
  • 12
  • 22
1
vote
2 answers

How to implement SQL Groupby in RAPIDS

I'm seeking to translate an SQL query to use RAPIDS. Consider the simplified query below: (SELECT min(a), max(b), c FROM T GROUP BY c) AS result I have validated the code below, but is this the optimal solution? Is sorting on the group key…
0
votes
1 answer

Solving environment: failed when install RAPIDS using conda

In order to install RAPIDS, I get the command from the site below and run it, but the following error occurs. https://docs.rapids.ai/install conda create --solver=libmamba -n rapids-23.08 -c rapidsai -c conda-forge -c nvidia rapids=23.08 python=3.10…
Tio
  • 944
  • 3
  • 15
  • 35
0
votes
0 answers

Can I run the cuML RandomForestClassifier with a sklearn MultiOutputClassifier wrapper on a dataset for a multilabel classification task?

I am currently working on an Excel datasheet where I have rows with several features for which I want to predict multiple labels. The features are either 0 or 1 because they describe whether something occurs in that row or not. The labels are also…
0
votes
0 answers

Huge variance for RandomForestRegressor models

The experiment is the following: train a RFR with a 15k train rows get predictions on 8k test rows, save predictions as y_hat0 remove 1 random row from the training set and retrain the RFR save prediction for the newly trained model as…
Oleg
  • 161
  • 1
  • 14