Questions tagged [rapids]

RAPIDS is a framework for accelerated machine learning and data science on GPUs

Questions pertaining to RAPIDS. From https://rapids.ai/ :

The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

195 questions
0
votes
1 answer

Wider points cuxfilter / Datashader scatter?

How can I get wider points to display in this datashader.scatter plot of cuxfilter? Or less whitespace between columns? This is a scatter plot of passenger_count (x) vs tip_amount (y) on some TLC yellow cab trip data. The goal is to have something…
gumdropsteve
  • 70
  • 1
  • 14
0
votes
1 answer

How to do a matrix dot product between two DataFrame in the GPU with rapids.ai

I'm using CUDF it's part of the rapids ML suite from Nvidia. Using this suite how would I do a dot product between two DataFrame? a = cudf.DataFrame([[0.1, 0.2, 0.3, 0.4], [0.1, 0.2, 0.3, 0.4]]) b = cudf.DataFrame([[0.1, 0.2], [0.1,…
MrJasonLi
  • 21
  • 3
0
votes
1 answer

Exception using CuDF apply_chunks - Use of unsupported NumPy function 'numpy.ones_like' or unsupported use of the function

I am trying to use numpy from within jit optimized code of Numba but I am getting errors when I am trying to do standard numpy operations like numpy.ones_like, even though numba documentation mentions that the operation is supported. Documentation…
Strider
  • 1
  • 5
0
votes
1 answer

Sklearn Kernel Density Data Type

I need to specify the dtype (data type) for sklearn's Kernel Density Function within a definition block from nvidia's rapids cudf library. In Python 3.7, I am able to find type information, but for some reason, it is not considered an accepted data…
ash291
  • 1
  • 1
0
votes
2 answers

Rapids CUML Random Forest Regression Model Inference

I am using the Random Forest Regression model from CUML 0.10.0 library on Google Colab and having trouble with obtaining model predictions. After the model training ends successfully, I am using the (.predict) method for inference on a very large…
0
votes
1 answer

Can rapids kneighborsclassifiers and kneighborsregressor be run with rapids?

I want to run kneighborregressor and kneighborclassifier on GPU. However, I cannot find out them in RAPIDS. Are these functions currently implemented or planned to be developed?
0
votes
1 answer

Is it possible to run RAPIDS on Debian?

I wanted to try RAPIDS but I have seen that it is only compatible with Ubuntu 16.04/18.04 and CentOS 7. Since I have already a completely operational Debian 9 system installed on my workstation, and since I know Ubuntu is a Debian derived distro, I…
0
votes
1 answer

Installing CUDA + RAPIDS on Ubuntu - "no kernel image is available"

EDIT: Solved in the comments below. I'm trying to get started with CUDA + RAPIDS. To do this, I've launched a VM on Google Compute using Ubuntu 18.04 and a NVIDIA Tesla K80. Here are the commands I've run in order to get the software installed: wget…
0
votes
1 answer

What to use in place of pandas.Series.filter?

pandas -> cuDF Converting some python written for pandas to run on rapids pandas temp=df_train.copy() temp['buildingqualitytypeid']=temp['buildingqualitytypeid'].fillna(-1) temp=temp.groupby("buildingqualitytypeid").filter(lambda x:…
gumdropsteve
  • 70
  • 1
  • 14
0
votes
2 answers

Equivalent of pd.Series.str.slice() and pd.Series.apply() in cuDF

I am wanting to convert the following code (which runs in pandas) to code that runs in cuDF. Sample data from .head() of Series being manipulated is plugged into OG code in the 3rd code cell down -- should be able to copy/paste run. Original code in…
gumdropsteve
  • 70
  • 1
  • 14
0
votes
3 answers

Replace values in Column C where value in Column A is x

Issue In process of replacing null values so column is boolean, we find null values in fireplace_count column. If fireplaceflag value is False the fireplace_count null value should be replaced with 0 written for…
gumdropsteve
  • 70
  • 1
  • 14
0
votes
1 answer

'nvstrings' object has no attribute 'to_gpu_array'

I'm using cuML for stochastic gradient descent. I used sklearn's train_test_split to generate the splits for train_X, train_y ... from a cuDF dataframe. The following code (I removed the hyperparameters which aren't relevant to this question): from…
Sterls
  • 723
  • 12
  • 22
-1
votes
1 answer

DASK CUDA on multi node EMR cluster is unable to detect nodes

I have setup an AWS EMR cluster using 10 core nodes of type g4dn.xlarge (each machine/node conatins 1 GPU). When I run the following commands on Zeppelin Notebook, I see only 1 worker allotted in my LocalCUDACluster: from dask_cuda import…
Putt
  • 299
  • 4
  • 10
-1
votes
2 answers

Is there a method to find girvan newman using CuGraph?

I have been using the Girvan-Newman algorithm from networkx to find the modularity of a network with 4039 nodes and 88,234 edges. Due to the nature of the algorithm, it was running overnight, and wouldn't complete. Hence I paid for colab pro and I…
-1
votes
1 answer

.data function in cuDF returning none

I am trying to make some operations using nvstrings but .data is returning None import cudf sents = cudf.read_csv("train.csv", quoting=3, skiprows=1, names=['review', 'label']) gstr = sents['review'].data print(gstr) -> None dataset…
Md Kaish Ansari
  • 251
  • 2
  • 7
1 2 3
12
13