Highest Voted 'catboost' Questions - Data Science Stack Exchange

24

votes

1 answer

Lightgbm vs xgboost vs catboost

I've seen that in Kaggle competitions people are using lightgbms where they used to use xgboost. My question is: when would you rather use xgboost instead of lightgbm? What about catboost?

asked Apr 19 '19 at 06:08

David Masip

6,136
2
28
62

6

votes

1 answer

How to achieve SHAP values for a CatBoost model in R?

I'm asked to create a SHAP analysis in R but I cannot find it how to obtain it for a CatBoost model. I can get the SHAP values of an XGBoost model with shap_values <- shap.values(xgb_model = model, X_train = train_X) but not for CatBoost. Here is…

machine-learning classification r shap catboost

asked Jul 22 '20 at 07:08

user100740

91
2

4

votes

1 answer

Are linear models better when dealing with too many features? If so, why?

I had to build a classification model in order to predict which what would be the user rating by using his/her review. (I was dealing with this dataset: Trip Advisor Hotel Reviews) After some preprocessing, I compared the results of a Logistic…

decision-trees linear-regression logistic-regression feature-engineering catboost

asked Jan 16 '22 at 15:43

dsbr__0

191
1
5

3

votes

0 answers

What is the concept behind the categorical-encoding used in the CatBoost benchmark problems?

I'm working through CatBoost quality benchmark problems (here). I'm particularly intrigued by the methodology adopted to convert categorical features to numerical values as described in the comparison_description.pdf (here). What is the reasoning…

boosting categorical-encoding catboost

asked Sep 04 '20 at 03:51

PPR

171
1
5

2

votes

1 answer

Catboost multiclassification evaluation metric: Kappa & WKappa

I am working on an unbalanced classification problem and i want to use Kappa as my evaluation metric. Considering the classifier accepts weights (which i have given it), should i still be using weighted kappa or just use the standard kappa? I am not…

python multiclass-classification catboost

asked Oct 07 '20 at 12:39

Musa

31
2

2

votes

0 answers

Tuning the learning rate parameter for GBDT models

I've always been taught that decreasing the learning rate parameter in gbdt models such as XGBoost, LightGBM and Catboost will improve the out-of-sample performance, assuming the number of iterations is increased accordingly and all else…

machine-learning xgboost lightgbm gradient-boosting-decision-trees catboost

asked Feb 26 '24 at 13:14

Casper

21
1

2

votes

0 answers

How to do grid search for Catboost with categorical_cols

I know it's easy to do grid search for a simple Catboost model, such as in here: https://medium.com/aiplusoau/hyperparameter-tuning-a5fe69d2a6c7 by running something like cbc = CatBoostRegressor() #create the grid grid = {'max_depth':…

grid-search catboost

asked May 24 '23 at 18:43

Ian

21
3

2

votes

2 answers

RandomizedSearchcv(n_iter=10) doesnt stop after training 10 models

I am using RandomizedSearchcv for hyperparameter optimization. When I run the model, it shows the scores for each model training. The problem is, it trains way more than 10 models when in fact I expect it to train just 10 models by specifying…

cross-validation grid-search catboost

asked Mar 31 '23 at 18:59

Mehmet Deniz

41
5

2

votes

0 answers

Catboost: Categorcial Feature Encoding

I would like to understand all the methods available in Catboost for encoding categorical features. Unfortunately, the published articles by Yandex ("CatBoost: gradient boosting with categorical features support" and "CatBoost: unbiased boosting…

encoding catboost

asked Sep 15 '22 at 18:52

calpyte

121
2

2

votes

1 answer

How do we target-encode categorical features in multi class classification problems?

Say I have a multiclass problem with a dataset as this: user_id price target -------+--------+----- 1 30 apple 1 20 samsung 2 32 samsung 2 40 huawei . . where I have a lot of users i.e One Hot…

catboost target-encoding

asked Jul 25 '22 at 12:16

CutePoison

520
3
10

2

votes

1 answer

How to tell CatBoost which feature is categorical?

I am excited to learn that CatBoost can handle categorical features by itself. One of my features, Department ID, is categorical. However, it looks like numeric, since the values are like 1001, 1002, ..., 1218. Those numbers are just IDs of the…

categorical-data catboost

asked Mar 15 '22 at 04:52

Fred Chang

95
1
2
6

1

vote

1 answer

Model Dump Parser (like XGBFI) for LightGBM and CatBoost

Currently my employer has multiple GLM in a live environment. I am interested in identifying new features and interactions to enhance the accuracy of these GLM; for now I am limited to the GLM structure so simply deploying a solution which…

python xgboost lightgbm catboost

asked Mar 03 '21 at 17:12

bradS

1,695
9
20

1

vote

1 answer

Does Gradient Boosting perform n-ary splits where n > 2?

I wonder whether algorithms such as GBM, XGBoost, CatBoost, and LightGBM perform more than two splits at a node in the decision trees? Can a node be split into 3 or more branches instead of merely binary splits? Can more than one feature be used in…

xgboost gbm lightgbm natural-gradient-boosting catboost

asked Dec 18 '20 at 15:03

Chong Lip Phang

231
3
9

1

vote

0 answers

Feature Selection before modeling with Boosting Trees

I have read in some papers that the subset of features chosen for a boosting tree algorithm will make a big difference on the performanceso I've been trying RFE, Boruta, Clustering variables, correlation, WOE & IV and Chi-square Let's say I have a…

r feature-selection xgboost lightgbm catboost

asked Oct 28 '20 at 19:28

Mamoud

11
2

1

vote

2 answers

Does gradient boosting algorithm error always decrease faster and lower on training data?

I am building another XGBoost model and I'm really trying not to overfit the data. I split my data into train and test set and fit the model with early stopping based on the test-set error which results in the following loss plot: I'd say this is…

xgboost overfitting boosting adaboost catboost

asked Aug 19 '20 at 18:28

Xaume

212
3
14

Questions tagged [catboost]