CatBoost is an open-source gradient boosting on decision trees library with categorical features support out of the box for Python, R
Questions tagged [catboost]
344 questions
6
votes
3 answers
Catboost default hyperparameters
How do I return all the hyperparameters of a CatBoost model?
NOTE: I do not think this is a dup of Print CatBoost hyperparameters since that question/answer doesn't address my need.
For example, with sklearn I can do:
rf =…

ADJ
- 4,892
- 10
- 50
- 83
6
votes
1 answer
Bayesian Optimisation applied in CatBoost
This is my attempt at applying BayesSearch in CatBoost:
from catboost import CatBoostClassifier
from skopt import BayesSearchCV
from sklearn.model_selection import StratifiedKFold
# Classifier
bayes_cv_tuner = BayesSearchCV(
estimator =…

prp
- 914
- 1
- 9
- 24
5
votes
3 answers
TypeError: __init__() got an unexpected keyword argument 'early_stopping_rounds' for CatBoost in Python
I get this error when I try to run a CatBoost algorithm. It is one of the parameters so I don't understand why this error occurs when I comment out early_stopping_rounds I get the same error for cat_features parameter
from catboost import…

Mine
- 831
- 1
- 8
- 27
5
votes
4 answers
Why does pip install not work for catboost?
I have to install catboost but can not make it by pip install catboost.
There is not catboost library in Anaconda, so pip in the one way.
The error message is:
Could not find a version that satisfies the requirement catboost
No…

Lumos
- 570
- 1
- 11
- 24
5
votes
3 answers
CatBoost precision imbalanced classes
I use a CatBoostClassifier and my classes are highly imbalanced. I applied a scale_pos_weight parameter to account for that. While training with an evaluation dataset (test) CatBoost shows a high precision on test. However, when I make predictions…

Alexey Nikolaev
- 51
- 1
- 2
5
votes
1 answer
What is the intuition behind symmetric trees in catboost algorithm?
I have been going through the catboost algorithm and it is hard for me to see the point of using symmetric trees. On this regard, i found in their github:
An important part of the algorithm is that it uses symmetric trees and builds them level by…

guillermo barquero
- 51
- 3
5
votes
3 answers
Is there way to save trained model after overfitting occurs in CatBoost?
I am using CatBoostRegressor in Python version of the Catboost library.
According to documentation, it's possible to use overfitting detector, which I am doing, like this:
model = CatBoostRegressor(iterations=iters, learning_rate=0.03, depth=depth,…

Mysterion
- 9,050
- 3
- 30
- 52
5
votes
1 answer
Example how to use catboost with the time series data
In the introduction/promo video (https://www.youtube.com/watch?v=s8Q_orF4tcI) you have mentioned that Catboost can analyse the time series historical data for weather forecasts.
But I was not able to find anything like this in tutorials:…

Alex Zaporozhets
- 193
- 3
- 7
4
votes
1 answer
Fitting linear function at leaves of a CatBoost model
Is there an equivalent to the linear_tree function in LightGBM in the CatBoost library? I would like to use a linear function at the leaves instead of a constant.

user308827
- 21,227
- 87
- 254
- 417
4
votes
1 answer
Difference in model feature importance and SHAP summary plot
I have been playing around the toy dataset to understand more about shap library and usage. I found this issue that the feature importances from the catboost regressor model is different than the features importances from the summary_plot in the…

Regressor
- 1,843
- 4
- 27
- 67
4
votes
0 answers
ImportError: dynamic module does not define module export function (PyInit__catboost)
I am getting the following error:
ImportError: dynamic module does not define module export function (PyInit__catboost)
I used
""../../../ya" make -r -DUSE_ARCADIA_CUDA_HOST_COMPILER=no --host-platform-flag USE_ARCADIA_CUDA_HOST_COMPILER=no…

王普聪
- 41
- 2
4
votes
0 answers
NotFittedError: All estimators failed to fit while using CatBoostClassifier with GridSearchCV
I am trying to use GridSearchCV with CatBoostClassifier for multiclass (3), and am getting error. The code seems to work OK in this Kaggle notebook. The estimator also works successfully without GridSearchCV.
Here is the code and error:
model =…

mparikhdspython
- 41
- 1
4
votes
1 answer
How to improve the catboostregressor?
I am working on a data science regression problem with around 90,000 rows on train set and 8500 on test set. There are 9 categorical columns and no missing data. for this case, I am applied a catboostregressor which given me the pretty good…

Sekhar
- 627
- 4
- 14
- 34
4
votes
1 answer
How to get catboost visualization to show the categories
Consider the following data:
import pandas as pd
y_train = pd.DataFrame({0: {14194: 'Fake', 13891: 'Fake', 13247: 'Fake', 11236: 'Fake', 2716: 'Real', 2705: 'Real', 16133: 'Fake', 7652: 'Real', 7725: 'Real', 16183: 'Fake'}})
X_train =…

Simd
- 19,447
- 42
- 136
- 271
4
votes
1 answer
Catboost plot_tree understanding
When plotting a tree from catboost, it shows val in leaves; what do these values represent?
I can't find the answer in their official tutorial on plotting, nor are there any such questions answered anywhere that I could find. Like:
LightGBM…

Nauman Naeem
- 408
- 3
- 12