Questions tagged [rapids]

RAPIDS is a framework for accelerated machine learning and data science on GPUs

Questions pertaining to RAPIDS. From https://rapids.ai/ :

The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

195 questions
0
votes
0 answers

TypeError: First element of field tuple is neither a tuple nor str

I am a beginner to RAPIDS. I am trying to run the following code on Colab. It is resulting in an error. TypeError: First element of field tuple is neither a tuple nor str Similari code runs well while using pandas. But failing while using cudf…
T. Hanuman
  • 11
  • 1
0
votes
1 answer

how can i test hdbscan using rapids without getting error

Good morning, i want to test the hdbscan (Hierarchical Density-Based Spatial Clustering of Applications w/ Noise)using GPU so i should use the framework rapids. When i tried to follow the steps described here…
aydi
  • 11
  • 2
0
votes
1 answer

How to implement Bokeh heatmaps using NVIDIA RAPIDS library for interactive dashboard visualisation?

I am currently experimenting with heatmap based interactive visualisation using RAPIDS library and I was able to successfully generate(figure-2) an interactive dashboard using Cuxfilter plugin(which uses Bokeh server and library) however I am only…
Sri
  • 51
  • 1
  • 3
0
votes
1 answer

ModuleNotFoundError: No module named 'BlazingSQL' Error after Installing RAPIDS with 'python rapidsai-csp-utils/colab/install_rapids.py

**Hello, I installed successfully following script on Google Colab for installing RAPIDS: 'python rapidsai-csp-utils/colab/install_rapids.py ' The options are 'stable' and 'nightly'. Leaving it blank or adding any other words will default to…
0
votes
1 answer

Is there a way of using the entire memory of my GPU for CUML calculations?

I am new to the RAPIDS AI world and I decided to try CUML and CUDF out for the first time. I am running UBUNTU 18.04 on WSL 2. My main OS is Windows 11. I have a 64 GB RAM and a laptop RTX 3060 6 GB GPU. At the time I am writing this post, I am…
0
votes
1 answer

How to accelerate finding all-pairs shortest path with GPU using rapids cugraph?

I'm trying to get a list of nodes like APSP (all pairs short path) and want to use rapids cugraph for GPU acceleration. Researched a bit and created this script which is working but it's very slow. I suppose I'm doing the wrong iteration and there…
0
votes
0 answers

Runtime Error when running a simple cuML code in a Dask environment

I'm trying to test a simple code using two remote workers. I don't know what is going on and what the error refers to. The code is simple: #!/usr/bin/python3 from cuml.dask.cluster import KMeans from cuml.dask.datasets import make_blobs from…
jcfaracco
  • 853
  • 2
  • 6
  • 21
0
votes
1 answer

Implementing GridSearchCV and Pipelines to perform Hyperparameters Tuning for KNN Algorithm

I have been reading about perfroming Hyperparameters Tuning for KNN Algorthim, and understood that the best practice of implementing it is to make sure that for each fold, my dataset should be normalized and oversamplmed using a pipeline (To avoid…
0
votes
0 answers

How do you do a grid search with cuml without a datatype error?

I tried doing a grid search with cuml. (rapids 21.10) I get a cupy conversion error. This doesn't happen if I build the model with the same dataset without a grid search. It also works doing it with the Data not lying in Videomemory, but it is then…
0
votes
1 answer

Why do I get a CUDA memory error when using RAPIDS in WSL?

I installed WSL 2 (5.10.60.1-microsoft-standard-WSL2) under Windows 21H2 (19044.1348) and using NVidia driver 510.06 with a pascal GPU (1070). I use the default ubuntu version in WSL (20.04.3 LTS) I tried both docker and anaconda versions. I can run…
0
votes
0 answers

cuDF rolling UDF not working with cuPY functions

I am trying to write a cuDF-UDF which computes the pearson auto correlation with lag==1 of a cuDF series. I have defined the following UDF: import cupy as cp def cuda_corr(x): xx=x[:-1] yy=x[1:] coef=cp.corrcoef(xx,y=yy, rowvar=False) …
0
votes
1 answer

How i can run GridSearchCV in dast_ml despite this error?

This is my code in Google Colab: import cupy as cp import numpy as np import joblib import dask_ml.model_selection as dcv def ParamSelection(X, Y, nfolds): param_grid = {'C': [0.001, 0.01, 0.1, 1, 10, 100],'kernel':['linear'], 'gamma':[0.001,…
0
votes
1 answer

cuml DBSCAN running slow with numba device array

I'm trying to use the cuml ofrapids to accelerate the process of dbscan clustering 15millions float64 data point. ''' pp = nb.cuda.to_device(ps) # ps is a (15636915,2) cupy array with cuml.using_output_type('input'): db_gpu = cumlDBSCAN(eps=0.8,…
0
votes
1 answer

what is the most efficient way to do `diff` for a `cudf`

The rapids.ai cudf type is somewhat compatible with pandas, but here is a strange incompatibility. cudf.Series has a .diff() method, but a cudf.DataFrame does not appear to. This is super-annoying (consider, for example, a data frame of stock…
Igor Rivin
  • 4,632
  • 2
  • 23
  • 35
0
votes
1 answer

ModuleNotFoundError: No module named 'clx'

I am attempting to run the first cell in Jupyter notebook: /rapids/notebooks/clx/alert_analysis/Alert_Analysis_with_CLX.ipynb, which results in following error: ModuleNotFoundError: No module named 'clx' From an IPython shell running on the Rapids…
carnak
  • 1
  • 2