Questions tagged [dask-ml]
79 questions
0
votes
1 answer
train_test_split - Cannot operate on Dask array with unknown chunk sizes
I have a text classification dataset where I used dask parquet to save disk space, but run into the problem now when I want to split the dataset into train and test using dask_ml.model_selection.train_test_split.
ddf =…

osterburg
- 447
- 5
- 24
0
votes
1 answer
How to run PCA with dask_ml. I am getting an error, "This function (tsqr) supports QR decomposition in the case of tall-and-skinny matrices"?
I want to perform dimensionality reduction over data with around 3000 rows and 6000 columns. Here the number of observations (n_samples) < number of features (n_columns). I am not able to achieve the result using dask-ml whereas the same is possible…
0
votes
1 answer
How to Create a dask dataframe from from a data string seperated by tabs and new line characters
I've my data in form of a string seperated by \ character (for columns) & by new line \n character for rows.
ID\Product\quantity\n1\xx\2
Looks like Dask.array.from_array() support only a array as input.
Although I can convert the above text to…

naresh chava
- 1
- 1
-1
votes
2 answers
Strange behaviour of GridSearchCV with hidden_layer_sizes
GridSearchCV (no matter if from sklearn or from dask) seems to to something strange or wrong with the parameters which leads to the MLPRegressor to ignore the parameter.
I show the behaviour in terms of a minimal working example.
Assume numerical…

Bastian Ebeling
- 1,138
- 11
- 38