I would like to add a new language to the Stanford Dependency Parser, but cannot for the life of me figure out how.
In what format should training data be? How do I generate new language files?
I would like to add a new language to the Stanford Dependency Parser, but cannot for the life of me figure out how.
In what format should training data be? How do I generate new language files?
The neural net dependency parser takes in CoNLL-X format data.
There is a description of the format in this paper: