I'm trying to solve a multiclass classification problem with BERT by leveraging the Ktrain wrapper. Having a large amount of data, I wanted to know if it is possible to parallelize the learner.fit_onecycle (3e-4, 5) function or the autofit () function.
For example as happens with sklerarn where you specify the number of parallel jobs, with the n_jobs parameter.
You can help me? Thanks