I have a txt file in, theoretically, CoNLL format. Like this:
a O
nivel B-INDC
de O
la O
columna B-ANAT
anterior I-ANAT
del I-ANAT
acetabulo I-ANAT
existiendo O
minimos B-INDC
cambios B-INDC
edematosos B-DISO
en O
la O
medular B-ANAT
(...)
I need to convert it into a list of sentence, but I don't find a way to do it. I tried with the parser of conllu library:
from conllu import parse
sentences = parse("location/train_data.txt")
but they give the error: ParseException: Invalid line format, line must contain either tabs or two spaces.
How can I get this?
["a nivel de la columna anterior del acetabulo", "existiendo minimos cambios edematosos en la medular", ...]
Thanks