Questions tagged [c4.5]

C4.5 is a decision tree algorithm invented by Ross Quinlan in 1993.

33 questions
1
vote
0 answers

Numeric Values in C4.5 algorithm

Threshold value Z: –The training samples are first sorted on the values of the attribute Y being considered. There are only a finite number of these values, so let us denote them in sorted order as {v1, v2, …, vm}. –Any threshold value lying between…
Nick
  • 2,818
  • 5
  • 42
  • 60
1
vote
1 answer

Use significant attributes only, or use full set of attributes to build J48 model after checking information gain?

Weka's J48 allows one to check information gain on a full set of attributes, should I use those significant attributes to build my model? Or should I use the full set of attributes?
Guanhua Lee
  • 156
  • 1
  • 12
1
vote
1 answer

I am looking for specific algorithms in Orange

I am learning data mining in class and I am having trouble with the Orange library. I am coming from Weka to Orange and am trying to find the J4.8 and the C4.5 algorithms, but I can't find them listed. A google search takes me to a page that says…
Marjorie
  • 31
  • 1
1
vote
1 answer

WEKA J48 decision tree with non linearly separable data

Does Weka J48 Decision Tree classifier support classification for a problem with intrinsically non linearly separable data? In short, is J48 either a linear or a non linear classifier?
Johan
  • 3,561
  • 9
  • 29
  • 45
1
vote
1 answer

C4.5 Select the split point (threshold) for a Continuous Attribute

Using the "play golf" or "play ball" data (listed at the bottom), to pick the root node we look at Outlook, Temperature, Humidity, and Wind, to see which has the highest GainRatio. Now, Outlook will be chosen as the attribute with the highest…
GreekFire
  • 359
  • 4
  • 15
1
vote
1 answer

Paralleizing implementation of Decision tree ID3/C4.5 on Hadoop

I want to implement decision tree ID3/C4.5 on Hadoop. Can anyone through idea how to go ahead. I am clear about the algorithms but I need to know how to parallelize them.
wmahendra
  • 29
  • 3
0
votes
0 answers

Need some assistance with Algo C4.5, got a recursion when building a decision tree

I need some help with C4.5 algo. I got one task to write that algo by myself and build a decision tree on a train data and then test it. Using data from UCI Machine Learning Repository. I wrote some code, got recursion child in child and it built…
0
votes
1 answer

ML Decision Tree classifier is only splitting on the same tree / asking about the same attribute

I am currently making a Decision tree classifier using Gini and Information Gain and splitting the tree based on the the best attribute with the most gain each time. However, it is sticking the same attribute every time and simply adjusting the…
0
votes
0 answers

Meaning of confidence factor in J48

I try to use J48 classifier from RWeka library in R (C4.5 algorithm). I can parametrize this classifier with C parameter which means 'confidence factor'. What does this value exactly mean? I know that bigger value means that I believe more my…
happysadek
  • 11
  • 1
  • 5
0
votes
1 answer

c4.5 algorithm missing values

How does the C4.5 algorithm deal with missing values and attribute value on continuous interval? Also, how is a decision tree pruned? Could someone please explain with the help of an example.
0
votes
1 answer

Draw a decision tree with results for an instance in Matlab

I have created a decision tree in Weka. I now want to calculate a prediction (with this model) in matlab and visualize the result nicely in the tree. But I haven't been able to find a good way to draw the tree and the results from the instance on…
dorien
  • 5,265
  • 10
  • 57
  • 116
0
votes
1 answer

Identify application based on its packets

Is it possible to identify a particular application by analysing it's packets ? If yes , which attributes of the packets may help identify an application uniquely ? Any help would be appreciated.
0
votes
0 answers

Pruning nodes in a decision tree

iam certainly working on decision algorithm trees and in my research this article came up http://www.slideshare.net/aorriols/lecture6-c45. If you go to slide 12 top-right corner there is an algorithm which i understand completely. The thing i dont…
Nick
  • 2,818
  • 5
  • 42
  • 60
0
votes
1 answer

dealing with the missing value when using C4.5 technique

I'm trying to build a classifier "model" using some classification techniques. Beginning with the C4.5 technique, faced the problem of missing values so: How to deal with the missing values exist in a data-set ? Should I have to stay on "?" in the…
Ibrahim
  • 77
  • 3
  • 10
0
votes
0 answers

How to generate confusion matrix in c45?

I am trying to implement c45 algorithm on Map Reduce and the code here generates only a rule set given some training data. This class contains the main method. public class DecisionTreec45 extends Configured implements Tool { public static…
Codebeginner
  • 193
  • 4
  • 14