Is there any option to add custom punctuation marks, which aren't included in the default punctuation rules? (https://github.com/explosion/spaCy/blob/develop/spacy/lang/de/punctuation.py)
I am using spaCy's Matcher class (https://spacy.io/usage/rule-based-matching) and the attribute "IS_PUNCT" to remove punctuation from my text.
from spacy.matcher import Matcher
# instantiate Matcher
matcher = Matcher(nlp.vocab)
# define pattern
pattern = [{"IS_PUNCT": False}]
# add pattern to matcher
matcher.add("Cleaning", None, pattern)
I would like to customize the punctuation rules to be able to remove "|" from my texts with the Matcher.