I'm trying to understanding what is the -wordNgrams parameter in the fastText.
Let's take the following text as an example:
The quick brown fox jumps over the lazy dog
Now we have the context windows size of 2 at the 'brown' word, then we would have the following samples
- (brown, the)
- (brown, quick)
- (brown, fox)
- (brown, jumps)
In case we set -wordNgrans 2, would we find in our vocabulary the word 'brown_fox' ? And hence, our training samples now would be:
- (brown_fox, the)
- (brown_fox, quick)
- (brown_fox, jumps)
- (brown_fox, over)
Is that correct ?
I didn't find out there any explanation about that.