I am explaining a HistGradientBoostingRegressor
from sci-kit learn. I use the TreeExplainer from the shap
package to get the SHAP values. My model has 48 features, some of which are strongly correlated. The beeswarm summary plot of the model looks like this:
How is it possible that the SHAP values of some features are consistently negative (or positive) and not centered at 0?
I never saw this anywhere else. Intuitively it does not make sense to me, since the model could just add a constant to avoid the shift.