Questions tagged [fuzzywuzzy]

FuzzyWuzzy is a Python package to perform fuzzy string matching.

FuzzyWuzzy is a Python package to perform fuzzy string matching.

Useful links

522 questions
0
votes
1 answer

Get the list of matching token from Fuzzywuzzy

I am using fuzzywuzzy token_set_ratio to match 2 strong. I want to know the tokens that were matching. Is there a function in fuzzywuzzy to do so? String1="this is a banana tree" String2="there is banana tree next to my house" the token_set_ratio in…
Sid
  • 552
  • 6
  • 21
0
votes
0 answers

Does anybody know if FuzzyWuzzy work inside IronPython?

Does the FuzzyWuzzy module work in IronPython? I am trying to use it inside another application that implements IronPython.
sjmurphy84
  • 33
  • 4
0
votes
1 answer

TypeError while using fuzz.ratio

I use the following simple code to calculate the fuzz.ratio() value of values of two lists and get the error : TypeError: object of type 'float' has no len(). (on the line : if (fuzz.ratio(i, j) >= 85): ) The code is below: from fuzzywuzzy import…
gunel
  • 161
  • 13
0
votes
1 answer

Is there any way to run loop faster?

I am working for some string matching problems and use fuzzywuzzy module to get score. My targeted data is around 67K and reference data is almost 4M, I created loop and one iteration is taking around +- 19minutes. Is there any way to make my loop…
0
votes
0 answers

Fuzzywuzzy returns 'ratio' not defined in Pycharm only

Why might I be getting a 'NameError: name 'ratio' is not defined' error when I attempt to use fuzzywuzzy in Pycharm. I have no issues using it in IDLE or python's 32-bit app. I've reviewed similar topics of "works in idle but not pycharm"; however,…
Rhelm
  • 11
  • 7
0
votes
1 answer

FuzzyWuzzy's fuzz.ratio not behaving as expected

I am using fuzzywuzzy to do fuzzy matching and expect fuzz.ratio to work the same/yield the same results when it's used independently vs. when it is used as the 'scorer' parameter in the process module. However, it does not. I have tried testing all…
formicaman
  • 1,317
  • 3
  • 16
  • 32
0
votes
0 answers

How to optimize the for loop for finding a matching 2 string using fuzzywuzzy

I am getting the probability of a string being similar to another string in Python using fuzzywuzzy lib. Currently, I am doing this using a for loop and the search is time consuming. Below is working code : from fuzzywuzzy import fuzz with…
0
votes
1 answer

Fuzzy String Matching With Pandas and FuzzyWuzzy,Data matching: TypeError: cannot use a string pattern on a bytes-like object

I have the data file which looks like this - And I have another data file which has all the correct country names. For matching both the files that, I am using below: import pandas as pd names_array=[] ratio_array=[] def…
0
votes
1 answer

Fuzzy String Matching With Pandas and FuzzyWuzzy ;KeyError: 'name'

I have the data file which looks like this - And I have another data file which has all the correct country names. For matching both the files that, I am using below: from fuzzywuzzy import process import pandas as…
0
votes
0 answers

Fuzzysearch: Looping over strings and keep the most similar

I have a number of correctly spelled names in a list L_name = ['Julia', 'John', 'James', 'Jay', 'Jordan'] And I have a list of results from a form where people entered their name L_entries = ['Julie', 'John', 'Jo', 'Jamie', 'Jamy', 'James', 'Jay',…
user9092346
  • 292
  • 2
  • 11
0
votes
1 answer

Merge similar strings together in pandas column

I have pandas crosstab dataframe which looks like this: This is a small sample of the whole dataframe. As you can see, sku1_entity has some strings like 4 Cheese W Verm, 4 Cheese w Verm, 4Cheese w Verm and similarly there are more such cases in the…
Harshit Singh
  • 625
  • 5
  • 17
0
votes
0 answers

How can i fuzzy match between two columns in my pandas data frame?

I tried to perform fuzzy match between two columns in my pandas data frame but could not get the right result. Could someone please help me with this one. What I have tried so far
0
votes
1 answer

Fuzzy Wuzzy returning "RuntimeError:"

I am working on a python script to convert semi-structured transcripts into structured xml documents. A document typically looks like this: (63019, 'Antero Midstream Partners LP (NYSE:AM) Q1 2017 Earnings Conference Call May 9, 2017 12:00 PM…
user1029296
  • 609
  • 8
  • 17
0
votes
0 answers

How to do fuzzy string matching between dictionary and dataframe

I have a dictionary like this Station Latitude Longitude London Bridge 51.50467435 -0.086005598 Kings Cross 51.53079819 -0.122756421 Kings Cross St. Pancras 51.53031246 …
shiadncvds
  • 27
  • 3
0
votes
2 answers

Combining values from Similar Strings in CSV File

So I have a CSV file full of transactions, with the vendor name in one column and the transaction amount in another. The goal is to find the top vendors in terms of total number of transactions. That part is pretty simple, and I have code like…