Questions tagged [gbm]

R package gbm, implementing Generalized Boosted Regression Models library.

This package implements extensions to Freund and Schapire’s AdaBoost algorithm and Friedman’s gradient boosting machine.

Includes regression methods for least squares,absolute loss, t-distribution loss, quantile regression,logistic, multinomial logistic, Poisson, Cox proportional hazards partial likelihood, AdaBoost exponential loss, Huberized hinge loss, and Learning to Rank measures (LambdaMart).

Who's using gbm?

The gbm package is used in examples in Software for Data Analysis by John Chambers.

gbm is also used in Elements of Statistical Learning by Hastie, Tibshirani and Friedman.

Richard A. Berk also uses gbm in his book, Statistical Learning from a Regression Perspective.

Source: gradientboostedmodels

330 questions

votes

1 answer

Is Gradient Boosting regression be more accurate (lower MSE) than the random forest?

I just created a Gradient Boosting model whose out-of-sample prediction is worse than the random forest. The MSE of GBM is 10% higher than the random forest. Below is my sample code. I am sure whether there is any wrong with it. gbm1 <-…

r random-forest gbm

asked Jun 09 '15 at 18:40

YYY

votes

0 answers

R: A very big cross-validation error

I have 303 data points in the train set (see the picture). Many of these points are equal to 0 on the Y axis. Now I want to train the GBM model to predict the Y value. Here is my model: train.subset<- data.frame(yval=train$yval, …

r r-caret gbm

asked Mar 23 '15 at 13:50

Klausos Klausos

15,308
51
135
217

votes

1 answer

GBM and Caret package: invalid number of intervals

Though I am defining that target <- factor(train$target, levels = c(0, 1)), the below-given code provides this error: Error in cut.default(y, unique(quantile(y, probs = seq(0, 1, length = cuts))), : invalid number of intervals In addition:…

r r-caret gbm

asked Feb 20 '15 at 19:48

Klausos Klausos

15,308
51
135
217

votes

1 answer

Possible bug in Caret function predict.gb()?

It seems to me that I’ve discovered a bug in the performance of the predict() function for method=gbm in the Caret package in R. I'm curious to know if others agree, or if someone has an explanation for the behavior of this function. 1. Generate…

r predict r-caret gbm

asked Feb 05 '15 at 05:16

Jeff W

votes

1 answer

can't delete y axis label in plot.gbm

Run-on question following this problem setting axis widths in gbm.plot; I'm now using plot.gbm directly and don't seem to be able to remove the y axis label, which seems to be set within the plot.gbm function…

r plot axis-labels gbm

asked Sep 04 '14 at 21:03

dez93_2000

1,730
2
23
34

votes

1 answer

(R) Axis widths in gbm.plot

Hoping for some pointers or some experiences insight as i'm literally losing my mind over this, been trying for 2 full days to set up the right values to have a function spit out clean simple line plots from the gbm.plot function (packages dismo &…

r plot axis-labels par gbm

asked Sep 03 '14 at 00:08

dez93_2000

1,730
2
23
34

votes

1 answer

Grandient Boosting Regressor : "TypeError: cannot perform reduce with flexible type"

I am new in python and I am trying to developp a program with Gradient Boosting Regressor. I have two big sets of data, one training set and one test set in wich I have exactly the same columns. My goal is to predict the SeriousDlqin2yrs column of…

python gbm

asked Mar 28 '14 at 09:14

user3471868

votes

1 answer

Representing a class by negative number for training in GBM in R

I have already generated a huge train file of ~1GB in which the ground truth is one of 4 classes {-1,0,1,2}. Yes, I agree that I could have done it using 3 instead of -1, but for some conventional reasons I used -1. I want to use gbm in multinomial…

r classification supervised-learning gbm

asked Jan 04 '14 at 17:42

Bit Manipulator

-1

votes

1 answer

How to find the optimum number of estimators using "OOB" method in sklearn boosting?

The gbm package in R has a function gbm.perf to find the optimum number of trees for the model using different methods like "Out-of-Bag" or "Cross-Validation" error, which helps to avoid over-fitting. Does Gradientboosting inScikit learn library in…

python scikit-learn hyperparameters gbm

asked Jul 26 '22 at 23:00

lakshman thota

-1

votes

1 answer

can i take best parameters and best model of optuna function and apply this model directly in my notebook?

i esttablished a function of optuna to find out best model of gbm and xgboost for my data but i was wondering if i can take the best model and apply it directly into my notebook(extracting best model as an object to reuse it later) here is my…

python machine-learning gbm optuna

asked Mar 29 '22 at 08:19

Baraa najjar

-1

votes

1 answer

Why does my code take so long to process?

I try to run code from this web site in my computer. I use data set from kaggle competition In my training data 1022 rows and 81 variables. I run this code: hyper_grid <- expand.grid( shrinkage = c(.01, .1, .3), interaction.depth = c(1, 3, 5), …

r gbm

asked Sep 03 '20 at 08:10

Ekaterina

-1

votes

1 answer

Getting probability values for random forest and Gradient Boosting in python

I have been learning about classification techniques and studied about random forest, gradient boosting etc.Based on some help from codes available online,i tried to write code in python3 for random forest and GBM. My objective is to get the…

python machine-learning data-science random-forest gbm

asked Jun 06 '20 at 13:22

Krish Basu

-1

votes

1 answer

Poor h2o GBM Classification Performance in a balanced binomial response

In a fairly balanced binomial classification response problem, I am observing unusual level of error in h2o.gbm classification for determining class 0, on train set itself. It is from a competition which is over, so interest is only towards…

h2o gbm balanced-groups

asked Apr 03 '17 at 11:15

Hari Prasad

1,751
2
15
20

-1

votes

1 answer

GBM cross validation

I'm trying to use R's gbm regression model. I want to compute the coefficient of determination (R squared) between the cross validation predicted response values and the true response values. However, the cv.fitted values of the gbm.object only…

r cross-validation gbm

asked Jun 28 '16 at 21:06

dan

6,048
10
57
125

-3

votes

1 answer

R, Caret, train(), predict(), GBM, Error: Error in model.frame.default(..): Factor has new levels

So I have a pretty good idea of what is happening but I'm wondering how to handle the error, I've seen other posts similar to this but they were not specific to Gradient Boosting Machine models. They all seem to be related to GLMs and the error…

r r-caret gbm

asked Mar 04 '16 at 23:32

Factuary

Prev 1 2 3

…