I am working with the python pandas_dedupe package, specifically with pandas_dedupe.dedupe_dataframe.
I have trained the dedupe_dataframe module via the interactive prompts. But now I need to retrain the dedupe_dataframe module. How can I erase the training set and start from scratch?
I have tried deleting the dedupe_dataframe_learned_settings and dedupe_dataframe_training.json files, but then then the python script throws an error.
I work with PyCharm as my IDE.
Any hint would be much appreciated. Thanks!