Highest Voted 'mlxtend' Questions

2

votes

2 answers

ColumnTransformer(s) in various parts of a pipeline do not play well

I am using sklearn and mlxtend.regressor.StackingRegressor to build a stacked regression model. For example, say I want the following small pipeline: A Stacking Regressor with two regressors: A pipeline which: Performs data imputation 1-hot…

asked Feb 18 '22 at 09:58

Alberto Santini

6,425
1
26
37

2

votes

2 answers

Why won't Colab import fpgrowth from mlxtend.frequent_patterns?

When I import mlxtend.frequent_patterns, the function fpgrowth and fpmax are not there. However, they are there if I use Jupyter Notebook in Anaconda Navigator. Anyone know why Colab will not import? import pandas as pd from mlxtend.preprocessing…

google-colaboratory mlxtend

asked Oct 03 '21 at 17:09

Pete

21
3

2

votes

2 answers

Convert a list of lists to array type in Python

I have a matrix like this and want to convert it to array for processing. How to do it [[25 3 0 1 0 2 1] [ 1 21 0 0 0 0 0] [ 0 3 18 0 0 0 0] [ 1 0 0 35 2 0 0] [ 0 0 0 4 27 2 0] [ 0 0 0 0 1 27 0] [ 1 1 0 0 0 …

python numpy numpy-ndarray mlxtend

asked Sep 10 '20 at 02:11

user567879

5,139
20
71
105

2

votes

1 answer

What does it mean AttributeError: 'ColumnSelector' object has no attribute 'n_features_in_'?

I am making a grid search for tuning hyperparameters of a stacking estimator(StackingClassifier object from sklearn.ensemble library). I making use of the scikit library for ML, and the RandomizedSearchCV function. In adition to this, the base…

scikit-learn pipeline gridsearchcv imblearn mlxtend

asked Aug 03 '20 at 22:17

Jonathan

23
3

2

votes

2 answers

Is it possible to set the color for the bottom region with `mlxtend.plotting`?

I am trying to reproduce the example in this post, which produces this figure. The colored regions above are plotted by mlxtend.plotting (version '0.14.0'). With the default settings on colab, this code from mlxtend.plotting import…

python mlxtend

asked Sep 17 '19 at 10:43

user11566345

2

votes

0 answers

My StackingCVClassifier Has Lower Accuracy than Base Classifiers Yet Does Very Well on Test Set

I built a simple Stacking Classifier with mlxtend and am trying different base classifiers and I am facing an interesting situation. From all my research it seems to me that stacking classifiers always perform better than their base classifiers. In…

machine-learning scikit-learn ensemble-learning mlxtend

asked Jan 10 '19 at 01:52

Odisseo

747
1
13
32

2

votes

2 answers

market basket analysis in python for large transaction dataset

On applying apriori (support >= 0.01) and association_rules functions using mlxtend package of python on 4.2L+ rows transaction data (in the form of sparse matrix) , generation of frequent item sets and association rules takes too much time. Sample…

python sparse-matrix apriori market-basket-analysis mlxtend

asked Oct 31 '18 at 05:55

Sumesh Iyer

21
1
4

2

votes

0 answers

scikit-learn mlxtend EnsembleVoteClassifier with sample_weights

I am trying to fit an EnsembleVoteClassifier according to mlxtend documentation For normal grid.fit I can use fit_params to set sample_weight, but with the VotingClassifier it does not work. How can this be solved? from sklearn import datasets iris…

machine-learning scikit-learn ensembles mlxtend

asked Mar 29 '18 at 01:18

user670186

2,588
6
37
55

2

votes

1 answer

mlextend plot_decision_regions with model fit on Pandas DataFrame?

I'm a big fan of mlxtend's plot_decision_regions function, (http://rasbt.github.io/mlxtend/#examples , https://stackoverflow.com/a/43298736/1870832) It accepts an X(just two columns at a time), y, and (fitted) classifier clf object, and then…

python pandas machine-learning data-visualization mlxtend

asked Mar 08 '18 at 07:49

Max Power

8,265
13
50
91

1

vote

0 answers

Slurm Cluster Python Script Not Running on Multiple Nodes using SBATCH

We recently setup a Slurm Cluster with 2 Nodes(1 headnode+compute node and 1 compute nodes) for some HPC CFD simulations.Right now i am trying to run some python script which is used for feature selection in one of our Machine learning project which…

python cluster-computing slurm feature-selection mlxtend

asked Jul 27 '23 at 11:32

akhil kumar

1,598
1
13
26

1

vote

1 answer

How to scan the candidate itemset by using the item matrix

I am doing a small data mining project and I encountered a problem that is, to scan the 'item matrix' and count the occurrence of each candidate itemset. This is the what candidate itemsets look like. It is a list of several frozensets. [{'', '',…

python pandas matrix apriori mlxtend

asked Sep 29 '22 at 21:43

Cooper

73
6

1

vote

1 answer

Scaling and data leakage on cross validation and test set

I have more of a best practice question. I am scaling my data and I understand that I should fit_transform on my training set and transform on my test set because of potential data leakage. Now if I want to use both (5 fold) Cross validation on my…

python machine-learning scikit-learn cross-validation mlxtend

asked Jun 29 '22 at 23:39

Birk

59
5

1

vote

0 answers

Issue in calculate variance,bias python using mlxtend

I am using mlxtend lib for bias,variance calculation. The code is, y=df[target] x=df.drop(target,axis=1) x_train, x_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state=1) model = LinearRegression() mse, bias, var =…

python scikit-learn mlxtend

asked Aug 31 '20 at 14:33

sundarr

385
2
8

1

vote

1 answer

Find corresponding rows with frequent itemsets

My dataset is an adjacency matrix comparable with customer buying information. An example toy dataset: p = {'A': [0,1,0,1], 'B': [1,1,1,1], 'C': [0,0,1,1], 'D': [1,1,1,0]} df = pd.DataFrame(data=p) df Now I am interested in the frequent itemset so…

python apriori mlxtend

asked Jul 03 '20 at 10:21

Tox

834
2
12
33

1

vote

1 answer

How to interpret results of Mlxtend's association rule

I am using mlxtend to find association rules: Here is the code: df = apriori(dum_data, min_support=0.4, use_colnames=True) rules = association_rules(df, metric="lift", min_threshold=1) rules2=rules[ (rules['lift'] >= 1) & (rules['confidence'] >=…

python-3.x mlxtend fpgrowth

asked Jun 18 '20 at 19:00

MAC

1,345
2
30
60

Questions tagged [mlxtend]