Random Forests with a Customized Loss Function

Question

I am a complete beginner in the field of machine learning. For a project, I have to use a customized loss function in the Random Forest Classification. I have used scikit till now. Suggestions on implementing this through scikit will be more helpful.

To avoid confusion, it should be pointed out that vanilla Decision Trees and Random Forests do not optimise an explicit loss function in the common sense of the word. You do have a choice, however, over the _split criterion_, which is used to determine when and where to split an internal tree node. — ngmir, May 10 '23 at 07:27

score 3 · Answer 1 · answered Apr 16 '14 at 10:41

Loss functions (Gini impurity and entropy in case of classification trees) are implemented in _tree.pyx cython file in scikit (they're called criteria in the source). You can start by modifying/adding to these functions. If you add your custom loss function (criterion) to the cython file, you also need to expose it in the tree.py python file (look at the CRITERIA_CLF and CRITERIA_REG lists).

Random Forests with a Customized Loss Function

1 Answers1

Linked