Questions tagged [auc]

The area under the ROC curve can be thought of as a single scalar representation of the ROC curve itself. The AUC of a classifier has the property of being equivalent to the probability that the classifier will rank a randomly chosen positive data point higher than a randomly chosen negative data point.

The area under the ROC curve can be thought of as a single scalar representation of the ROC curve itself. Since this value represents part of the area of a 1x1 square, the AUC is a value between 0.0 and 1.0. However, since a classifier should always perform better than random, the realistic domain of the AUC values should be 0.5 to 1.0. The AUC of a classifier has the property of being equivalent to the probability that the classifier will rank a randomly chosen positive data point higher than a randomly chosen negative data point [Fawcett, 2006]. It can be shown that the AUC is related to the Gini coefficient [Hand et al, 2001]. The AUC can be estimated using trapezoidal approximation by considering the interval between consecutive points [Hand et al, 2001]

Fawcett, Tom. 2006. “An Introduction to ROC Analysis.” Pattern Recognition Letters 27 (8) (June): 861–874. doi:10.1016/j.patrec.2005.10.010.

Hand, David J, and Robert J Till. 2001. “A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems.” Machine Learning 45 (2) (January 1): 171–186. doi:10.1023/A:1010920819831.

535 questions
0
votes
1 answer

Area under curve

curve(-12 * log(x) - (415 / x), 25, 50) abline(h = -55, lty = 2) I would like to plot the area between the curve and solid line and shade it, but couldn't get to do it. I tried using the trapz function from pracma package. Any suggestions would be…
0
votes
1 answer

Calculate AUC and GAM and set a scale in R

I have a data form as follows: x y chla sst ssha eke tuna : : : : : : : : : : : : : : I used a GAM model as follows: GAM <- gam(tuna~s(chla), family = binomial, data = nonLinear) By…
ahmad
  • 21
  • 1
  • 5
0
votes
1 answer

R function colAUC in package caTools fails with large samples

For example: require(caTools) colAUC(runif(90000), sample(c(0,1), 90000, replace = TRUE)) [,1] 0 vs. 1 0.5000629 works fine, however colAUC(runif(100000), sample(c(0,1), 100000, replace = TRUE)) gives [,1] 0 vs. 1 …
James King
  • 6,229
  • 3
  • 25
  • 40
0
votes
2 answers

Calculate many AUCs in R

I am fairly new to R. I am using the ROCR package in R to calculate AUC, which I can do for one predictor just fine. What I am looking to do is perform many AUC calculations for 100 different variables. What I have done so far is the…
-1
votes
1 answer

How to find AUC value of Decision Tree?

Decision Tree I have found Misclassification rates for all the leaf nodes. samples = 3635 + 1101 = 4736, class = Cash, misclassification rate = 1101 / 4736 = 0.232. samples = 47436 + 44556 = 91992, class = Cash, misclassification rate = 44556 /…
-1
votes
1 answer

AUC 1, but accuracy <100%

When testing a binary classifier I get an accuracy of 83% (when the threshold is set to 0.5) however when I workout the ROC and AUC I get an AUC value of 1, which I believe is incorrect as in this case I should be getting an accuracy of 100? I have…
user3165683
  • 347
  • 1
  • 9
  • 28
-1
votes
1 answer

How do we interpret AUROC graph ? I have applied 100 fold cross validation. How to interpret multiple line graphs in AUROC?

I have applied Random Forest algorithm on the dataset having 203 classes. I have applied 100 fold cross validation. The metric i used is auroc. what does this auroc graph represents that I got given below?
AkAnKsHa BaLi
  • 37
  • 1
  • 1
  • 12
-1
votes
1 answer

How to calculate ROC curve in python using my own classified dataset?

I have a dataset that contains two columns: i) Column A: Absolute binary values (0 and 1) represent the ground truth. ii) Column B: Values ranging from 0 to 1 (e.g., 0.2, 0.33, 0.82) represent my new new test results. How can I calculate the…
Medic00
  • 11
  • 3
-1
votes
2 answers

Please help to fix it: TypeError: predict_proba() missing 1 required positional argument: 'X'

I was building a binary classifier using the random forest classifier. Before it, I did a feature selection based on the high AUC score. However, when I wanted to get AUC for this model I couldn't. Here is the code below. Sorry for the lack of the…
Shu
  • 67
  • 6
-1
votes
1 answer

Area under curve of a dataframe column

I'm trying to calculate the cumulative AUC of a dataframe values from first row to the current row. Ex: points AUC 0 0 0 1 1 0.5 2 2 1 3 3 4.5 4 4 8 5 5 12.5 6 4 17 7 0 19 8 -2 18 9 -2 16 I can use np.trapz() but I…
-1
votes
1 answer

How to use AUC packet? to calculate AUC

I train and make prediction with the following example : ======================= The outputs are : and : Now I need to calculate AUC using the AUC package, but I can't quite understand how to do it? auc(roc(????, ?????)) Thanks Manel
-1
votes
1 answer

AUC curve didn't even show on the plot

After using logistic regression to predict stuff, this is the confusion matrix I got: True Positives: 3 False Positives: 1309 True Negatives: 12361 False Negatives: 4 The roc_auc_score is here: roc_auc_score(y_test,…
Shane Li
  • 73
  • 1
  • 9
-1
votes
1 answer

How to calculate f1 score using multiprocessing in python?

I've got an array of paired binary labels: y_true, y_pred. My array contains ~50 million elements, and I wish to evaluate success using f1 score preferably, or AUC. However, calculating f1 using sklearn takes relatively long time – about half the…
-1
votes
1 answer

In Classification, what is the difference between the test accuracy and the AUC score?

I am working on a classification-based project, and I am evaluating different ML models based on their training accuracy, testing accuracy, confusion matrix, and the AUC score. I am now stuck in understanding the difference between the scores I get…
Arsik36
  • 277
  • 3
  • 16
-1
votes
1 answer

HOW TO FIND AND PLOT AUC score from TPR and FPR

I have True positive rate and False positive rate for different thresholds. Now I need to compute AUC_ROC curve without using sckit-learn library. TPR and FPR values are below: TPR =…
1 2 3
35
36