I am evaluating various named entity recognition (NER) libraries, and I'm trying out Polyglot.
Everything seems to be going well, but the instructions tell me to use this line in the command prompt:
!polyglot --lang en tokenize --input testdata/cricket.txt | polyglot --lang en ner | tail -n 20
...which should give (in the example) this output:
, O
which O
was O
equalled O
five O
days O
ago O
by O
South I-LOC
Africa I-LOC
in O
their O
victory O
over O
West I-ORG
Indies I-ORG
in O
Sydney I-LOC
. O
That's exactly the kind of output I need for my project, and it works exactly like I need it to work; however, I need to run that within my PyCharm interface, not the command line, and store the results in a pandas dataframe. How do I translate that command?