I am working for some string matching problems and use fuzzywuzzy module to get score.
My targeted data is around 67K and reference data is almost 4M, I created loop and one iteration is taking around +- 19minutes. Is there any way to make my loop run faster?
%%timeit
df11['NEW'] = ""
for i in range(0, 4):
df11['NEW'] = process.extractOne(df11['Desc 1'][i], df['Description 2'])
df11.head()