Questions tagged [xgboost]

XGBoost is a library for constructing boosted tree models in R, Python, Java, Scala, and C++. Use this tag for issues specific to the package (i.e., input/output, installation, functionality).

Before using the XGBoost tag, try to test whether your issue is related specifically to the functionality of XGBoost. Often, problems arise from the surrounding model-building environment (such as R's caret or Python's scikit-learn), the quality of the data being used, or purely statistical concerns that might belong on Cross Validated.

2788 questions
1
vote
1 answer

Setting the scalePosWeight parameter for the Spark xgBoost model in a CV grid

I am trying to tune my xgBoost model on Spark using Scala. My XGb parameter grid is as follows: val xgbParamGrid = (new ParamGridBuilder() .addGrid(xgb.maxDepth, Array(8, 16)) .addGrid(xgb.minChildWeight, Array(0.5,…
1
vote
1 answer

How to show original feature names in the feature importance plot?

I created XGBoost model as follows: y = XY.DELAY_MIN X = standardized_df train_X, test_X, train_y, test_y = train_test_split(X.as_matrix(), y.as_matrix(), test_size=0.25) my_imputer = preprocessing.Imputer() train_X =…
ScalaBoy
  • 3,254
  • 13
  • 46
  • 84
1
vote
0 answers

Python import leads to [WinError 193] %1 is not a valid Win32 application, though import works in `cmd`

I have installed XGBoost on Windows 10 today, and it works well when I import it in python running in a common cmd prompt. But when I use Jupyter Notebook or the Anaconda Prompt and Python, it shows the error [WinError 193] %1 is not a valid Win32…
coffee
  • 21
  • 3
1
vote
1 answer

error of grid search xgboost in h2o (version 3.20.0.2)

I am relative new to h2o and was trying to use xgboost with grid search. I ran my stuff on edgenode with 40 cores and 26 gb memory with version 3.20.0.2 of h2o package in R and h2o. just cpu as backend. I have run gbm and randomforest without issues…
ASU_TY
  • 617
  • 2
  • 7
  • 12
1
vote
1 answer

speedml not working with py-xgboost / conda installed py-xgboost is not recognized by pip

I have install py-xgboost from anaconda using: conda install py-xgboost where it is shown up when I run conda list conda env export But from pip it doesnt show up: pip freeze Is there a way to make pip able to 'see' xg-boost that is installed…
David Tam
  • 485
  • 5
  • 13
1
vote
1 answer

Why two different AUC scores are produced when evaluated on same data and same algorithm

I am working on a classification problem whose evaluation metric in ROC AUC. So far I have tried using xgb with different parameters. Here is the function which I used to sample the data. And you can find the relevant notebook here (google…
ksai
  • 987
  • 6
  • 18
1
vote
0 answers

most efficient way to convert categorical character data to sparse matrix in R?

I'm trying to train using the xgboost algorithm. This algorithm requires that the data be numerical, and I believe even more specifically, of class dgCMatrix (I could be wrong on this last point). I have data stored in a data frame that is…
Isaac T
  • 31
  • 1
1
vote
3 answers

Improving boosting model ,reducing Root mean square error

Hi i am solving a regression problem.My data set consists of 13 features and 550068 rows.I tried different different models and found that boosting algorithms(i.e xgboost,catboost,lightgbm) are performing well on that big data set.here is the code…
1
vote
1 answer

XgBoost accuracy results differ on each run, with the same parameters. How can I make them constant?

The 'merror' and 'logloss' result from XGB multiclass classification differs by about 0.01 or 0.02 on each run, with the same parameters. Is this normal? I want 'merror' and 'logloss' to be constant when I run XGB with the same parameters so I can…
Ian Dzindo
  • 197
  • 1
  • 5
  • 12
1
vote
2 answers

Xgboost Installation error on windows .OSError: [WinError 127] The specified procedure could not be found

I am trying to install Xgboost on a windows 10 PC, I followed each and every step as per installation guide All the installation processes have been completed, but I'm getting an error while importing the library: OSError: [WinError 127] The…
1
vote
1 answer

Bazel rules_python fails to copy xgboost into runfiles directory

I have a toy python project containing the following files: requirements.txt: numpy==1.14.3 xgboost==0.71 print_numpy_version.py: from __future__ import print_function import numpy print('numpy version: %s' %…
1
vote
0 answers

Use Saved xgboost Sklearn XGBClassifier Model State for prediction of new data

Currently I have a working xgboost XGBClassifier Model trained perfectly with good accuracy. I have stored the model state (instance of the model) for new prediction on new python file by loading the state. I am unable to load the labelencoder from…
Rahul
  • 21
  • 3
1
vote
2 answers

Running xgboost on Skylake CPU is significantly slower

We're compiling xgboost v0.7 from source on a vanilla Ubuntu docker image. This image is being ran on our EC2 instances in a time critical setting. Recently we've tried the new EC2 c5 instance type, that is supposed to be Intel Skylake gen CPUs.…
Re'em
  • 230
  • 2
  • 11
1
vote
0 answers

"out of memory" error on GPU support on XGBoost

[20:44:18] Allocated 994MB on [0] GeForce MX150, 670MB remaining. [20:44:19] Allocated 151MB on [0] GeForce MX150, 517MB remaining. [20:44:19] Allocated 37MB on [0] GeForce MX150, 475MB remaining. Traceback (most recent call last): File…
1
vote
2 answers

Multiple scoring metrics with sklearn xgboost gridsearchcv

How do I run a grid search with sklearn xgboost and get back various metrics, ideally at the F1 threshold value? See my code below...can't find what I'm doing wrong/don't understand errors.. ######################### just making up a dataset…
runningbirds
  • 6,235
  • 13
  • 55
  • 94