I have a dataset from industry and testing the classification performance using Decision Tree (DT), Random Forest (RF) and ensemble classifiers (EL) such as Bagging, Boosting, etc.
- The issue is that I am getting fairly similar accuracy for all classifiers. Are the RF and EL are dependent on DT?
- Is it fair to draw a performance comparsion between DT, RF and EL in academic papers?
I looked through the existing questions (1), (2) but the objectives are different from my question. Python
from sklearn.tree import DecisionTreeClassifier % Decision Tree
from sklearn.ensemble import RandomForestClassifier % Random forest
from sklearn.ensemble import AdaBoostClassifier % Ensemble learner
MATLAB
Model = fitctree(X,Y) % % Decision Tree
Model = fitensemble(X,Y,'category_encoded','Bag',100,'Tree','Type','classification'); % Random forest
Model = fitcensemble(X,Y) % Ensemble learner