I am using these two derivates of GBM on Caret. I tried both algo on the same dataset and they return different accuracy and perform in different time. By the name, I can think that the first use a Linear function somewhere, and the other use trees. Anyway, I do not understand where it uses the Linear rather than Tree. I know that GBM algorithms use the tree as the predictor, can be that the first cases use different structure like training? Where can I find some documentation on this topic?
Thanks