Questions tagged [dask-ml]

79 questions
0
votes
1 answer

train_test_split - Cannot operate on Dask array with unknown chunk sizes

I have a text classification dataset where I used dask parquet to save disk space, but run into the problem now when I want to split the dataset into train and test using dask_ml.model_selection.train_test_split. ddf =…
osterburg
  • 447
  • 5
  • 24
0
votes
1 answer

How to run PCA with dask_ml. I am getting an error, "This function (tsqr) supports QR decomposition in the case of tall-and-skinny matrices"?

I want to perform dimensionality reduction over data with around 3000 rows and 6000 columns. Here the number of observations (n_samples) < number of features (n_columns). I am not able to achieve the result using dask-ml whereas the same is possible…
0
votes
1 answer

How to Create a dask dataframe from from a data string seperated by tabs and new line characters

I've my data in form of a string seperated by \ character (for columns) & by new line \n character for rows. ID\Product\quantity\n1\xx\2 Looks like Dask.array.from_array() support only a array as input. Although I can convert the above text to…
-1
votes
2 answers

Strange behaviour of GridSearchCV with hidden_layer_sizes

GridSearchCV (no matter if from sklearn or from dask) seems to to something strange or wrong with the parameters which leads to the MLPRegressor to ignore the parameter. I show the behaviour in terms of a minimal working example. Assume numerical…
Bastian Ebeling
  • 1,138
  • 11
  • 38
1 2 3 4 5
6