The Python API doesn't give much more information other than that the seed=
parameter is passed to numpy.random.seed
:
seed (int) – Seed used to generate the folds (passed to numpy.random.seed).
But what features of xgboost
use numpy.random.seed
?
- Running
xgboost
with all default settings still produces the same performance even when altering the seed. - I have already been able to verify
colsample_bytree
does so; different seeds yield different performance. - I have been told it is also used by
subsample
and the othercolsample_*
features, which seems plausible since any form of sampling requires randomness.
What other features of xgboost
rely on numpy.random.seed
?