I want to ignore annotation characters when parsing text on syntaxnet.
For example, in the case below, I want to ignore <X>
and </X>
annotation characters.
<PERSON>Michael Jordan</PERSON> is a professor at <LOC>Berkeley</LOC>.
So, I expect next output.
_ <PERSON> _ ...
1 Michael _ ...
2 Jordan _ ...
_ </PERSON> _ ...
3 is _ ...
...
Isn't SyntaxNet has such kind of features?