For example, if I have the words MKIK or "牛逼" (which is artificially created) how can we tell neural networks (transformer model) to keep the same output?
The problem is with using the transformer model on fairseq.
I found fairseq has --replace-unk
parameters, but it doesn't seem to work on transformer model or it has a bug