I'm fairly new to decision trees
and have a bit of trouble interpreting them when I move down branches. I have a few questions regarding the plot that was made on R. The response variable is Survived
(Yes
/No
), which is to be predited by age, fare, number of siblings, and number of parents I attached a decision tree below using Kaggle's Titanic
data-set.
- What do the different colors of green/blue mean?
- How do I interpret the leaf nodes?
- I understand the very top node inteprets to 38% survived, 62% did not survive, and 100% of the population is in that bucket. If I move to the right...how would I interpret Bucket #3? And if I kept going, Bucket #6? Etc etc...