Questions tagged [lightgbm]

LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages: ... Support of parallel and GPU learning. Capable of handling large-scale data.

LightGBM is a high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

Resources:

676 questions
-1
votes
1 answer

Unexpected keyword argument when running Lightgbm on GPU

When running the code below: import lightgbm as lgb params = {'num_leaves': 38, 'min_data_in_leaf': 50, 'objective': 'regression', 'max_depth': -1, 'learning_rate': 0.1, …
freak11
  • 341
  • 3
  • 16
-1
votes
1 answer

Machine learning - does the independent variable data need to be balanced as well?

I know that we need to have balanced data in y to have a better model. However, I'm wondering whether we need to have balanced data in independent variable as well. In the following dataframe, X3 is a category type independent variable. X1 X2 …
-1
votes
1 answer

When dealing with the absence of libomp Error when import LightGBM, an error occurred on Brew installing

on Mac OS I was trying command as following on Terminal: $brew install libomp **Error**: cmake: /usr/local/Homebrew/Library/Taps/homebrew/homebrew-core/Formula/cmake.rb:1: syntax error, unexpected << <<<<<<<…
xjw
  • 1
-1
votes
1 answer

Saving and Loading lightgbm Dataset

I am trying to save and load lightgbm datasets using the save_binary command. The following seems to work for the saving part: import numpy as import lightgbm as lgb data = lgb.Dataset(np.array([[1,2],[12,2]])) data.save_binary('test.bin') But so…
DvdG
  • 724
  • 1
  • 8
  • 15
-1
votes
1 answer

Signal 11: SIGSEGV Error when loading (or even modeling) LightGBM on Mac

Dearest S/O community, I can't seem to pinpoint the reason my python kernel dies throwing `Process finished with exit code 139 (interrupted by signal 11: SIGSEGV)`, be it when I attempt to load an LGBM model from a pickle file or train…
Subbu
  • 37
  • 5
-1
votes
1 answer

Why does tuple come out from LGBM?

from lightgbm import LGBMClassifier model = LGBMClassifier() model.fit(X_train, y_train) This is my code. I got this error in model.fit(): AttributeError Traceback (most recent call last) …
OZOOOH
  • 31
  • 5
-1
votes
1 answer

Microsoft.ML porting away from .netcore to .net frameworks 4.7.2

After moving the working code for the githubticketclassifier, a decision tree using the microsoft.ml lightGbm decision tree trainer I had great results and made my own dataset. I then converted it to .net 4.7.2 frameworks in a console environment…
-1
votes
2 answers

How to choose the best values for the param_distributions and fit_params of a cross-validation to fit a model?

I'm implementing a LightGBM Classifier (LGBMClassifier) whose hyperparameters are chosen by a RandomizedSearchCV cross-validation (sklearn library). I have used some arbitrary values for either the param_distributions and fit_params, but how should…
-1
votes
1 answer

Probability predictions from a saved LGBMClassifier model

I trained a LGBMClassifier model and saved it to a file thus: clf = lgb.LGBMClassifier( ... ) clf.fit(X_train, y_train, **fit_params) clf.booster_.save_model("model1.txt") Now what I want is to re-use the saved model for probability predictions.…
Sachin Yadav
  • 748
  • 9
  • 28
-1
votes
2 answers

idea behind xgboost/lightgbm/catboost in comparison

I'm trying to decide, which one of the following I will use in practice for regression tasks: xgboost, lightgbm or catboost (python 3). So, what are general idea behind each of them? Why should I choose one, but not another? I'm not interested in…
awqwewrqaq
  • 17
  • 4
-1
votes
1 answer

How random_state works in lightGBM algorithm?

Why is not it a unique process? Specifically, which part has randomness? I only know random forest. random forest uses bootstrap data, so I can understand its randomness.
user3887424
  • 1
  • 1
  • 1
-1
votes
3 answers

binary:logistic like parameter in LightGBM

I want my predictions in probabilities between 0 and 1. I already did that in xgboost but I wanna try out Lightgbm too but its outputting solid predictions(that is in integer only). I could do that in XGBoost by setting 'objective' parameter to…
vishal tewatia
  • 103
  • 1
  • 7
-2
votes
1 answer

cannot import lightgbm - error with pandas

Does anyone have a same problem and know some solutions? I recently used lightgbm(maybe few days ago), but suddenly I cannot import it. It says AttributeError: module 'pandas.core.strings' has no attribute 'StringMethods' I want to resolve this…
ILS
  • 1
  • 1
-2
votes
1 answer

Training a Neural network model on KMeans Clusters

I am classifying a client's clients. However, the data is fluid and the clusters can change every day. Running new clusters daily to update user clusters is difficult because Kmeans is inconsistent in labeling clusters. If we cluster, and then…
acacia
  • 1,375
  • 1
  • 14
  • 40
-5
votes
1 answer

lightgbm how to predict_proba?

if i use lightgbm there is two methods of using lightgbm. first method: - model=lgb.LGBMClassifier() model.fit(X,y) model.predict_proba(values) i can get predict_proba method to predict probabilities. if i use it natively import lightgbm as…
desmond
  • 1,853
  • 4
  • 21
  • 27
1 2 3
44
45