Questions tagged [iris-dataset]

Relates to the Iris flower dataset published by Ronald Fisher's 1936 paper "The use of multiple measurements in taxonomic problems".

Questions regarding the dataset and its application, for example in statistics and machine learning, are appropriate for this tag.

114 questions
0
votes
0 answers

How to solve a problem with null values in SVM and GA

Unfortunately, I have a problem with my code in R. I am trying to use GA to tune up hyperparameters, but I received null values, so it is impossible to train svm. Do you have any idea how to solve the…
mzwk
  • 5
  • 2
0
votes
0 answers

My Perceptron algorithm gives me a single training example prediction on Python for Iris dataset

I'm new to Machine Learning. My Perceptron code works but just for the first training example. Also, the b_final is n-dimensional array instead of being a scalar. Can you help me with these issues? The X.shape = (150,4) and y.shape = (100,). The…
0
votes
0 answers

Plot decision region does not correspond to my accuracy

I've made some SVM model and I have a problem with some parameters. Here is my code : fig, ax = plt.subplots(3,3, figsize=(15,10)) index = 1 for a,i in zip(ax,C): for b,j in zip(a,G): poly = SVC(kernel='poly', C = i, gamma = j, degree =…
Jonah
  • 3
  • 2
0
votes
0 answers

Feature scaling/normalization on data which has the same unit of measurement(ex. cm)

In my understanding of feature scaling/normalization you need to do this, because of different measurement units inside the features and bring them to a uniform scale. And in my opinion this only belongs to data of different measurement units (ex.…
0
votes
0 answers

Binary classification with genetic programming

Im using python DEAP library to build a binary classifier but im having problems to understand the documentation. Does anyone know or have a document that explains this library more easily? Any different ideas on how to get to the solution are…
0
votes
1 answer

mini-batch gradient descent, loss doesn't improve and accuracy very low

I’m trying to implement mini-batch gradient descent on the popular iris dataset, but somehow I don’t manage to get the accuracy of the model above 75-80%. Also the loss does not decrease and is rather stuck at around 0.45, even when I set the number…
0
votes
1 answer

Getting this error called on Kaggle as ""ImportError: cannot import name 'DecisionBoundaryDisplay' from 'sklearn.inspection'""

I have searched for this error on stackoverflow, people have asked about it but I'm using and working in Kaggle which doesn't need any environment and library to install and set up. Help me out with this. import…
0
votes
1 answer

How to create shinyapp for multiple histogram plots?

I am working with the iris dataset to create histograms for the four numerical variables with individual sliders for specifying a maximum value for the variable, i.e., the slider for “Sepal.Width” could be adjusted from 2.2 to 4.4 and if the user…
Tommy
  • 39
  • 5
0
votes
0 answers

Should we do label encoding to our target variable while doing a classification problem

I am a beginner in machine learning and i want to know if it's okay not to do label encoding our y variable before applying logistic regression or other classification algorithms on data. Or should we label encode target variable before fitting on…
0
votes
0 answers

Why a smaller neural network is mapping all the data to a fixed point

I have trained a smaller neural network (4, 3, 1) with a smaller dataset (150 records, 4 features, 1 class). I trained the model with mini_batch_size = 1 and for 10 epochs. Now the trained model mapping all the test data to a fixed value. Could…
0
votes
3 answers

Pandas Concat not appending to end of dataframe

I'm Making a decision tree based on the iris dataset. Out professor asked us to select 40 data points from each iris type. Which is why I ran train_test_split 3 times for each flower type. Then I'm supposed to Kfold the Training set. As there are 3…
Watson221
  • 73
  • 7
0
votes
0 answers

How to find out range (min & max) value in sepal length column for each flower types

There are three types of flowers in the attached dataset(iris dataset) and I want to find out the range of sepal_length for all flower types. How can I do it? Dataset Dataset has 5 columns namely SEPAL_LENGHT,SEPAL_WIDTH,PETAL_LENGTH,PETAL_WIDTH &…
0
votes
1 answer

Extracting the observations of two species from the iris dataset in R

The iris dataset contains 150 observations of three plant species (setosa, versicolor and virginica), being 50 observations of each species. I would like to create a new dataframe, called "a", containing only the observations of two of these species…
goshawk
  • 73
  • 5
0
votes
1 answer

Adjust color of geom_vline in ggplot based on group

So I am creating density plot for Sepal.Length for two species of the iris-dataset. The graph looks like this: As you can see, I added two measures of central tendency to the graph (Median and Mean). However, right now they are colored depending on…
Maximilian
  • 89
  • 1
  • 7
0
votes
1 answer

Matplotlib is not showing my scatterplot?

When I use plt.show the plot only shows the PCA lines and not a scatterplot of the first 2 iris features import numpy as np import matplotlib.pylab as plt from sklearn import decomposition x = np.load("iris_features.npy")[:, :2] y =…