When viewing the top predicted tokens in masked language modelling (MLM), is it possible to use top_k
with k=len(vocab)
?
So far, I have used this following line of code:
mask_filler("The capital of [MASK] is Paris", top_k=5)
Would it be possible to incorporate k=len(vocab)
into the line of code above to see the predicted tokens in my vocabulary or not?