Looking at the pbt_example, I see that the actor's accumulated accuracy
is not reset to zero in reset_config
, only the new hyperparameters are set (including changing lr
in particular). While implementing my own actor, which as state has a running average of the current metric that should be minimized by the hyperparameter sweep, I was wondering if I should reset this running average to zero or not in reset_config
? The implications of either resetting this state or not are not clear from the documentation and from browsing the source files of ray.tune
.
Would appreciate some clarification on reuse_actors
and reset_config
, in particular. The documentation is unfortunately a bit vague in this respect. Otherwise great library and easy to get started!