I have been given a set of 80 non-english words in an excel file..the first column contains the resulting word after a crude automatic segmentation has been applied to it and the second column contains the resulting word after being segmented manually. Below is a set of 3 rows of the file
Auto segmentation ......... Manually segmented
- [%D-Ik--(is$) ........... [%D-Ik]--(is$)
- [%D-Ip-t-eR]-(u$) .... [%D-I]-[pt-eR]-(u$)
- [%D-Om-(a$) ........... [%D-Om]-(a$)
My question is: is there a way with which I can train a model with this set of examples in order to segment new words (that start from d) automatically?