H20 says in the documentation that splitting on a feature for regression gbms is based on the reduction in squared error.
Is this squared error based on the node residuals, i.e., (resid - mean resid)^2 or is it the true response, i.e., (response - mean response)? I'm using gamma/ Poisson distributions.
In the case of gamma/Poisson, the loss is the deviance so why is the squared error used instead?