I have to process hundreds of thousands of texts. I have found that the thing that is taking the longest in the following:
nlp = English()
ruler = EntityRuler(nlp)
patterns = [...]
ruler.add_patterns(patterns)
nlp.add_pipe(ruler)
...
#This line takes longer than I would like
doc = nlp(whole_chat)
Granted, I have many patterns. But is there a way to speed this up? I only have the entity ruler pipe, no others.