Questions tagged [yellowbrick]

Yellowbrick is a Python visualization library for machine learning. It extends the Scikit-Learn API to provide visual diagnostic tools for classifiers, regressors, clusterers, transformers, pipelines, feature extraction tools and more. This tag should be used to ask questions about how to use visualizers, how to extend or modify visualizations, or how to interpret diagnostics. This tag is commonly used with the scikit-learn and matplotlib tags.

Yellowbrick (sometimes referred to as scikit-yellowbrick) is a Python library that extends the Scikit-Learn API to enhance the machine learning workflow with visual diagnostics with matplotlib. The yellowbrick tag is therefore usually applied in combination with the scikit-learn, python, and matplotlib tags. Good questions for this tag include:

  1. Questions about how to work with or extend existing visualizers
  2. Questions about how to interpret visual results
  3. Questions about how to modify resulting figures or annotate them
  4. Questions about how to create new visualizers

The best questions will include a code example along with the figure being generated by Yellowbrick. In order to allow others to run your code, if the visualization is not data specific (e.g. the questions is not the result of specific input), please use one of the example datasets from the Yellowbrick tutorial or one of Scikit-Learn's dataset generation methods. It is also very helpful if you include the version of Yellowbrick you're using, which can be found with print(yellowbrick.__version__).

For result interpretation questions, please be as general as possible and focused on the content of the visualizer. For example, a good question is "what is the meaning of the macro-average curve on ROC/AUC with more than two classes?" A poor question is "how do I make my model have a higher F1 score?"

Finally, for both folks asking questions and those responding, Yellowbrick contributors and developers take seriously respectful discourse. In addition to be nice, the StackExchange code of conduct, Yellowbrick also follows the Python Software Foundation Code of Conduct

Definitions

Yellowbrick extends the Scikit-Learn API with a new Estimator called a Visualizer. Visualizers are estimators, in that they can be fit with data in order to produce a visualization. Anything that produces a visualization in Yellowbrick is a Visualizer, though there are different types.

A FeatureVisualizer produces a representation of the feature space or data space. These are used to explore the input to models or the relationship of data to the model.

A ModelVisualizer produces a representation of the model space, describing how the model interacts with data or behaves. It does this in two ways, by describing internal parameters of the model, or by describing the relationship to test data with a ScoreVisualizer.

73 questions
0
votes
2 answers

Is there any way to connect to Yellowbrick database in Azure Data Factory

I wanted to connect to Yellowbrick(YB) database (Postgresql) as a source in the Azure Data Factory. The YB instance is on the cloud. And I had set the encryption to 'No Encryption'. In the server name, if I pass the domain name, then the error I get…
0
votes
1 answer

Yellowbrick learningCurve: chage legend

I want to plot the learning curve of train and validation set using yellowbrick learning curve. I am not using cross validation when plotting the learning curve, but a hold out validation set. Anyway, the legend is fixed to "Cross validation…
nopact
  • 195
  • 2
  • 12
0
votes
1 answer

How to change axis labels when using YellowBrick's KElbowVisualizer?

I am using the following code to create a silhouette coefficient plot with KElbowVisualizer: # Import the KElbowVisualizer method # Instantiate a scikit-learn K-Means model model = KMeans(random_state=0) # Instantiate the KElbowVisualizer with…
Thelonious Monk
  • 426
  • 5
  • 19
0
votes
1 answer

Yellowbrick prediction error graph edit lables and legends

I want to plot prediction-error using Yellowbrick visualizer but i am not getting the desired results. The plot is similar to the pp plot or a qq plot which is not correct. Also i am not able to change the labels of the axes and add title nor am i…
0
votes
1 answer

why do running from yellowbrick.target import FeatureCorrelation results in sklearn.metrics.classification error?

My Goal is to avoid the warning message below What I did house_data = pd.read_csv('datasets/house_data_processed.csv') target = house_data['price'] features = house_data.drop('price', axis=1) features.columns from yellowbrick.target import…
George Ng
  • 51
  • 2
  • 7
0
votes
1 answer

'Ridge' is not a CV regularization model; try ManualAlphaSelection instead

I am trying to find the best Alpha for a Ridge model without CV, using Yellowbrick ManualAlphaSelection API. My code is pretty basic and it has been taken from the yellowbrick´s documentation. Even though it does not work: from…
0
votes
1 answer

Set the plot size of KElbowVisualizer

So I have this vector, and I want to cluster them with simple K-Means clustering, but first, I need to look for the optimum k-cluster with the Elbow method. I use the KElbowVisualizer function from the YellowBrick package to find the optimum…
0
votes
1 answer

Is it possible to use pipelines preprocessed X and y in later analysis without a pipeline?

I am preprocessing data via Pipelines, turning categoricals to numeric, encoding etc... and it's very comfortable. But there is instance later in the project, where I want to test out some feature importance and I need to give X and y to the model.…
ValdemarT
  • 77
  • 5
0
votes
1 answer

classification report using YellowBrick

i recently implemented Probabilistic Neural Network on the iris dataset. i was trying to print classification report using YellowBrick classifier but when i run this code i get an error. which is as following. from neupy import algorithms model =…
gendry
  • 67
  • 9
0
votes
1 answer

Need to create a table in yellowbrick database from SAS dataset

We have a SAS dataset. Need to create a table in yellowbrick database with the same structure of SAS dataset without data. Is there a way to create such a table from sas dataset
0
votes
0 answers

how to set colors for yellowbrick residual plot

I want to use yellowbrick Residual plot to show the residuals for of a linear regression model. From the doc's, I can see that Regression_Plot accepts a single color value for the training datasets. train_colorcolor, default: ‘b’ Residuals for…
dubbbdan
  • 2,650
  • 1
  • 25
  • 43
-2
votes
1 answer

How to change xtick of Yellowbrick's Learning Curve visualizer?

I'm trying to change the xtick of Yellowbrick's learning curve figure from number of samples to normalized number(%) of samples. I googled a lot but couldn't find the way.
-2
votes
1 answer

How to remove the title from yellowbrick ROCAUC

I am using yellowbrick to plot the AUCROC. I want to remove the title from the plot, to make it empty without the plot title. model = classifier visualizer = ROCAUC(model, encoder={0: 'class' , 1: 'class2'} visualizer.fit(X_train, y_train) …
user
  • 5
  • 3
1 2 3 4
5