Questions tagged [classification]

In machine learning and statistics, classification is the problem of identifying which of a set of categories a new observation belongs to, on the basis of a training set of data containing observations whose category membership (label) is known.

In machine learning and statistics, classification refers to the problem of predicting category memberships based on a set of pre-labeled examples. It is thus a type of supervised learning.

Some of the most important classification algorithms are support vector machines svm, logistic regression, naive Bayes, random forest random-forest and artificial neural networks neural-network.

When we wish to associate inputs with continuous values in a supervised framework, the problem is instead known as regression. The unsupervised counterpart to classification is known as clustering (or cluster analysis), and involves grouping data into categories based on some measure of inherent similarity.

7859 questions

votes

0 answers

Tensorflow Estimator Feature Column increase weight

I have a DNNLinearCombinedClassifier to predict if an article get sold or not. I need DNN for feature like description and Linear for features like size, category, price, etc. In general it works, but the weight of the price is too low. The price is…

tensorflow classification feature-engineering

asked Apr 27 '20 at 20:12

NiBurhe

votes

1 answer

Accuracy of model got stuck at 50% while training an Age and Gender detection model

I was working through the Keras implementation of Age and Gender Detection model described in the research paper Age and Gender Classification using Convolutional Neural Networks'. It was originally a Caffe model but I thought to convert it to…

tensorflow keras deep-learning classification conv-neural-network

asked Apr 26 '20 at 10:53

Aditya Gupta

votes

0 answers

Python - Compare similarity / classify images with SIFT descriptors quickly

I understand that this is a popular question on Stack Overflow however, I have not managed to find the best solution yet. Background I am trying to classify an image. I currently have 10,000 unique images that a given image can match with. For each…

python opencv classification knn sift

asked Apr 16 '20 at 13:17

brian4342

1,265
8
33
69

votes

2 answers

Python Fraud Detection Classification Algorithms

I am working on a credit card fraud detection model and have labeled data containing orders for an online store. The columns I am working with is: Customer Full Name, Shipping Address and Billing Address (city, state, zip, street), Order Quantity,…

python machine-learning classification data-science fraud-prevention

asked Apr 06 '20 at 02:54

pali

votes

0 answers

using class weights with sklearn votingClassifier

I have an imbalance dataset for a classification problem. My target variable is binary and has two category. I implemented Random Forest and Logistic Regression by assigning class_weights as parameter. When I fit data to random forest and logistic…

scikit-learn classification ensemble-learning imbalanced-data

asked Apr 02 '20 at 13:14

FA05

votes

2 answers

LightGBM : validation AUC score during model fit differs from manual testing AUC score for same test set

I have a LightGBM Classifier with following parameters: lgbmodel_2_wt = LGBMClassifier(boosting_type='gbdt', num_leaves= 105, max_depth= 11, learning_rate=0.03, …

python machine-learning classification auc lightgbm

asked Apr 01 '20 at 13:05

Nayak S

votes

1 answer

Pipeline and GridSearchCV, and Multi-Class challenge for XGBoost and RandomForest

I am working on workflows using Pipeline and GridSearchCV. MWE for RandomForest, as below, ################################################################# # Libraries ################################################################# import…

python machine-learning classification data-science

asked Apr 01 '20 at 04:53

Saravanan K

votes

0 answers

How to classify people's clothes by Gabor filter?

I'd like to identify person from another using Gabor filter. It is working fine but I don't understand how to classify. Does it need for example to SVM as classifier? I understand from this paper that it don't need SVM OR another classifier The full…

python classification textures gabor-filter

asked Mar 30 '20 at 14:07

Redhwan

votes

1 answer

Finding data points close to the decision boundary of a classifier

Sorry if this is a very simple question. But I'm a newcomer to the field. My specific question is this: I have trained an XGboost classifier in Python. After the training, how can I get the samples in my training data that are closer than a fixed…

python classification xgboost

asked Mar 27 '20 at 17:45

iii

votes

1 answer

How to get multi-class roc_auc in cross validate in sklearn?

I have a classification problem where I want to get the roc_auc value using cross_validate in sklearn. My code is as follows. from sklearn import datasets iris = datasets.load_iris() X = iris.data[:, :2] # we only take the first two features. y =…

python machine-learning scikit-learn classification cross-validation

asked Mar 24 '20 at 10:20

EmJ

4,398
9
44
105

votes

4 answers

How to choose n_estimators in RandomForestClassifier?

I'm building a Random Forest Binary Classsifier in python on a pre-processed dataset with 4898 instances, 60-40 stratified split-ratio and 78% data belonging to one target label and the rest to the other. What value of n_estimators should I choose…

python classification random-forest hyperparameters

asked Mar 20 '20 at 03:05

keenlearner

votes

1 answer

How to combine two LSTM layers with different input sizes in Keras?

I have two types of input sequences where input1 contains 50 values and input2 contains 25 values. I tried to combine these two sequence types using a LSTM model in functional API. However since the length of my two input sequences are different, I…

python keras deep-learning classification lstm

asked Mar 14 '20 at 04:17

EmJ

4,398
9
44
105

votes

1 answer

Sklearn different results with the same random_state across different systems (machines)

I have a python script that generates predictions using sklearn Random Forest and fixed random_state = 0. It produces always deterministic results on the one computer (system) but when I switch to another computer, results are different. Is there a…

python machine-learning scikit-learn classification random-forest

asked Mar 11 '20 at 13:47

EnesZ

votes

2 answers

High precision recall for train data but very poor for test data in classification problem

I'm very new to ML and I'm trying to build a classifier for unbalanced binary class for a real life problem. I've tried various models like Logistic regression, Random Forest, ANN, etc but every time I'm getting very high precision and recall…

machine-learning classification

asked Mar 09 '20 at 05:38

vishnu priya

votes

1 answer

F1 - score with imbalanced data

I am working on a binary classification task. My evaluation data is imbalanced and consists of appr. 20% from class1 and 80% from class2. Even I have good classification accuracy on each class type, as 0.602 on class1, 0.792 on class2 if I calculate…

machine-learning statistics classification precision imbalanced-data

asked Mar 06 '20 at 11:54

metalrt

Prev 1 2 3

…

99 100 Next