Stanford dependency parser training data format

Question

I would like to add a new language to the Stanford Dependency Parser, but cannot for the life of me figure out how.

In what format should training data be? How do I generate new language files?

score 0 · Accepted Answer · answered Apr 06 '17 at 02:08

0

The neural net dependency parser takes in CoNLL-X format data.

There is a description of the format in this paper:

answered Apr 06 '17 at 02:08

StanfordNLPHelp

Thanks, got it to work after some hacking around. Paper helped beautifully. – player.mdl Apr 30 '17 at 14:55

1 Answers1