Questions tagged [fuzzywuzzy]

FuzzyWuzzy is a Python package to perform fuzzy string matching.

FuzzyWuzzy is a Python package to perform fuzzy string matching.

Useful links

522 questions
-1
votes
1 answer

Rearrange words in array to matching position calculating Levenshtein distance Php

Rearrange words in Array based on position of the first array. In my code there are two array my first array is the base array from which i am going to compare it with second array and make the position same as first array. Consider 2 input by…
user10655999
-1
votes
1 answer

Try, Except / If Statement Combination - Missing results

I am comparing one list of universities with 12 other lists, finding fuzzy string matches and writing all results to a csv. I am not doing the fuzzy string match to one big list as I need to know what list the match came from. Example of the…
Uralan
  • 79
  • 1
  • 9
-1
votes
1 answer

python - fuzzy matching, looping through a data set to find corresponding items in the reference set

I am trying to learn and implement fuzzy matching in python. I have two data sets which I load as data frames into pandas. Set 1 is the reference set. Set two is the set containing data to match with the reference names. I loop through the set_1…
Chris
  • 767
  • 1
  • 8
  • 23
-2
votes
1 answer

why do i get a key error from output when i do a merge

hi please help me I am trying to fuzzy merge using pandas and fuzzywuzzy on two datasets using two columns from each, but I get a traceback at the line before the print function that says KeyError: ('name', 'lasntname'), I do not know if I am…
Lamo
  • 11
  • 3
-2
votes
1 answer

How to compare all the first element of tuple with corresponding second element in the same tuple

i have a list of tuples as following: terms = [('cat', 'cat'), ('cat', 'bat'), ('cat', 'cat'), ('cat', 'cat'), ('cat', 'bat'), ('cat', 'No Data'), ('cat', 'bat'), ('cat', 'No Data'), ('bat', 'cat'), ('bat', 'bat'), ('bat', 'cat'), ('bat', 'cat'),…
Vas
  • 918
  • 1
  • 6
  • 19
-3
votes
1 answer

Find All Variation Of A Word In A Column

How to identify all the variation of a word in a column_one, and then fill a value in other column, , columns_two, whenever a variation of that word is found? E.g. Fill column value with P, whenever a variation of "PHIADELPHIA" is found, and fill…
-3
votes
1 answer

Standardizing company names

I have a list of company names, but these have misspelling and variations. How best can I fix this so every company has the consistent naming convention (for later groupby, sort_value, etc.)? pd.DataFrame({'Company': ['Disney','Dinsey', 'Walt…
denpy
  • 279
  • 2
  • 10
-3
votes
1 answer

FuzzyWuzzy Wratio score bug

I understand how basic fuzzy-wuzzy and its scores work. However, I came across a scenario where fuzzy-wuzzy gives a high score for WRatio even though the two strings do not seem to have similarities of any sort. (Image below for reference). Can…
Shreyesh Desai
  • 569
  • 4
  • 19
-3
votes
3 answers

How to count how many sentences are similar?

I have a dataset made by 2 columns, one for users and one for texts: `User` `Text` 49 there is a cat under the table 21 the sun is hot 431 could you please close the window? 65 there is a cat under the table 21 …
user12907213
-3
votes
1 answer

String matching algorithms in python

I am looking for some suggestions on the algorithms which could be used for string matching which also supports non-english languages too. Previously tried algorithm: I have tried Levenshtein distance (Fuzzy matching) with token_sort_ratio…
-3
votes
1 answer

Find the similarity between a string input and a string column of a Data Frame

I have a pandas data frame with two columns containing strings, like below: Col-1 Col-2 Animal have an apple Fruit tiger safari Veg Vegetable Market Flower Garden From…
ssp
  • 71
  • 1
  • 1
  • 5
1 2 3
34
35