I am NLP enthusiast and i plan to write some basic nlp models for my language(Azerbaijani), which do not have any good opportunities on Spacy\NLTK. Can you tell me please "roadmap" to realize this goal?
Firstly, i am going to write pos tagger with helped of Hidden Markov Model(HMM) and approximately 1000 tagged texts and after use this model like a tagger for further model training. And after that, write typical functions like token.pos_, token.tag_.
After Pos-tagger, i want to write lemmatize()
function. But i don't have idea how to write it. Can you please tell some steps for achieving this goal?