Questions tagged [xgboost]

XGBoost is a library for constructing boosted tree models in R, Python, Java, Scala, and C++. Use this tag for issues specific to the package (i.e., input/output, installation, functionality).

Before using the XGBoost tag, try to test whether your issue is related specifically to the functionality of XGBoost. Often, problems arise from the surrounding model-building environment (such as R's caret or Python's scikit-learn), the quality of the data being used, or purely statistical concerns that might belong on Cross Validated.

2788 questions
1
vote
0 answers

How Does Everyone Deal with Probabilities from XGBoost Scored Data? Scala

Training xgboost like so then getting back a set of response and probabilities. The probabilities come back as a vector: %scala import ml.dmlc.xgboost4j.scala.spark.{DataUtils, XGBoost} val dataset = sqlContext.table("train_set") val paramMap…
1
vote
0 answers

Number of observations in a node in XGBoost

I understand how the cover is calculated in XGBoost, the sum hessian at that node. For the root node of tree 1 for binary logistic, it becomes n(.5)(1-.5) with base score as 0.5. The cover at root node for tree 1 is sum(p(1-p)) where p is the…
Kingstat
  • 11
  • 3
1
vote
2 answers

R Xgboost validation error as stopping metric

I am using a train and validation dataset on an xgboost binary classification model. params5 <- list(booster = "gbtree", objective = "binary:logistic", eta=0.0001, gamma=0.5, max_depth=15, min_child_weight=1, subsample=0.6, …
1
vote
0 answers

h2o.xgboost is throwing null pointer exception

I am trying to run h2o.xgboost() in R and was able to use that successfully in 3.14.0.3 version. But, I recently updated to 3.18.0.8 version and I am getting below error. I tried lot of things but was not able to find reason. Any help will be…
Rushabh Patel
  • 2,672
  • 13
  • 34
1
vote
0 answers

GridSearch with XGBoost producing Depreciation error on infinite loop

I am trying to do a hyperparameter tuning using GridSearchCV on XGBoost.But, I'm getting the following error. /usr/local/lib/python3.6/dist-packages/sklearn/preprocessing/label.py:151: DeprecationWarning: The truth value of an empty array is…
1
vote
0 answers

load XGBoost model in django framework

I´m trying to load an xgboost model from a FileField of a Django model entity. It is not working. def do(execution): dataFile = execution.dataFile np = genfromtxt(dataFile, delimiter=',') import xgboost as xgb bst =…
1
vote
1 answer

Decision_function for XGBoost in SKLearn wrapper

I get different results for model.predict_proba(X)[:,0] compared to model.decision_function(X)for a regular Grad Boost Decision Tree classifier in SKLearn so I know that that is not the same. I want the scores of the model. To plot ROC curves etc.…
user7867665
  • 852
  • 7
  • 25
1
vote
1 answer

xgb.create.features in xgboost in R returns exception

In trying to apply the xgb.create.features(model, data, ...) function included in the xgboost package in R and covered in the User Manual, I am running into an exception I do not understand. Here is a reproducible…
user8270077
  • 4,621
  • 17
  • 75
  • 140
1
vote
1 answer

How to add weight vector to xgboost when calling via pandas_ml

In xgboost you can add a weight matrix to the data matrix (fourth argument of xgboost.DMatrix): http://xgboost.readthedocs.io/en/latest/python/python_api.html#module-xgboost.sklearn How can I pass this weight matrix when calling xgboost bia…
Bazman
  • 2,058
  • 9
  • 45
  • 65
1
vote
1 answer

Issue with XGboost tuning with Hyperopt- TypeError: 'generator' object has no attribute '__getitem__'

I am testing hyperoptim for parameter tuning of XGboost. I am kind of replicating the code from here- https://www.kaggle.com/eikedehling/tune-and-compare-xgb-lightgbm-rf-with-hyperopt I am using Python 3 and getting the following error for the code…
Lesnar
  • 501
  • 3
  • 16
1
vote
0 answers

How can I use XGboost in foreach and doParallel in R?

I'm having a problem while trying to run xgboost inside a foreach with doParallel, I'm using a AMD Naples with 32 cores and 64 threads. The run gets stuck with this part of code, I would like to have the results of the last line of the block inside…
igormr
  • 21
  • 3
1
vote
0 answers

XGBoost Feature Importance Mismatch

I get different values of the importances when I use model.feature_importances_ versus xgb.plot_importance(model). Moreover, the numpy array feature_importances do not directly correspond to the indexes that are returned from the plot_importance…
NLR
  • 1,714
  • 2
  • 11
  • 21
1
vote
2 answers

Xgboost. obtain the booster from the best iteration

I am using Xgboost.train to train my model; however, i am not sure how to obtain the booster from the best iteration instead of the booster from the last iteration. …
Paul Wu
  • 11
  • 1
  • 3
1
vote
1 answer

XGBOOST install from windows binary fails

guys I'm trying to install gpu enabled version xgboost, and I followed exact step from [this instructions].(http://www.picnet.com.au/blogs/guido/2016/09/22/xgboost-windows-x64-binaries-for-download/) git clone https://github.com/dmlc/xgboost.git…
Dirk Paul
  • 129
  • 13
1
vote
1 answer

Going through the xgboostExplainer package: running into errors from github page

I am currently trying to work with the new xgboostExplainer package. I am following the githib page here https://github.com/AppliedDataSciencePartners/xgboostExplainer/blob/master/R/explainPredictions.R on line 34, the xgboost model is…
user113156
  • 6,761
  • 5
  • 35
  • 81
1 2 3
99
100