I'm just started to use scikit-learn after years of datamining with SAS/SPSS products. I'm amazed by the capability of scikit-learn and pandas however there is one thing I can't figure out by myself. Let us assume that my training data is build up by integers, some of them encoding categorical values. Is there any way I can control how to interpret the variables by the tree or any ensemble tree (e.g.:ExtraTreesClassifier) algorihm? The proper way is to change the variable type from int to object, or is there a common trick I might learn?
Thanks, dealah