Questions tagged [soundex]

Soundex is an phonetic algorithm for indexing names based on their pronunciation in spoken English.

Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.

Soundex is the most widely known of all phonetic algorithms mainly because it is a standard feature of popular database software (such as MySQL, MS SQL Server and Oracle) and some programming languages (such as PHP).

Soundex was developed by Robert C. Russell and Margaret K. Odell and patented in 1918 and 1922

Articles

159 questions
0
votes
1 answer

Soundex returns 0 for every character

I have this function which converts a word to a soundex value, but it is returning all letters as 0. My last name, Smith, should by S530, but returns S000, for example. Link for code (since it's too long to put on here):…
kjsmita6
  • 458
  • 1
  • 6
  • 21
0
votes
1 answer

determining soundex conversion

when converting the name 'Lukasieicz' to soundex (LETTER,DIGIT,DIGIT,DIGIT,DIGIT), I come up with L2222. However, I am being told by my lecture slides that the actual answer is supposed to be L2220. Please explain why my answer is incorrect, or if…
0
votes
0 answers

Searching List data

I'm trying to work out the best way possible to search a List based on say merchant name and amount. For example consider following Transaction class or goals : public class Transaction { public string MerchantName; public double Amount …
Mike
  • 780
  • 1
  • 5
  • 26
0
votes
1 answer

Is there a way to generate words that sound similar to a given dictionary word without using a corpus?

I am trying to use phonetic algorithms like Soundex and/or Metaphone to generate words that sound similar to a given dictionary word. Do I have to have a corpus of all dictionary words for doing that? Is there another way to generate words that…
0
votes
1 answer

Fuzzy autocomplete

In my application, I have a users table, with first_name and last_name. I currently have a third column full_name (automatically generated) like this : first_name + last_name + first_name (without special chars). "Etienne", "De Crécy", "Etienne De…
pierallard
  • 3,326
  • 3
  • 21
  • 48
0
votes
1 answer

Increase fault tolerance for SQL Soundex for fuzzy string matching

Is there a way to increase the fault tolerance of the SQL SOUNDEX function when used for fuzzy string matching? I am using this to search for supplier names. What I have is: WHERE SOUNDEX(@SearchTerm) = SOUNDEX(s.Name) This works to some…
Carel
  • 2,063
  • 8
  • 39
  • 65
0
votes
0 answers

Delete duplicate rows with soundex?

I have two tables, one has foreign keys to the other. I want to delete duplicates from Table 1 at the same time updating the keys on Table 2. I.e count the duplicates on Table 1 keep 1 key from the duplicates and query the rest of the duplicate…
ebolton
  • 1,126
  • 3
  • 14
  • 20
0
votes
1 answer

Using Soundex and CriteriaBuilder API from EclipseLink

Currently in the process of creating a object search using CriteriaBuilder, Predicates, JPA 2.0 with EclipseLink as provider. My challenge is accessing the soundex capabilities and applying it to a dynamically built criteria. CriteriaBuilder…
rdonovan
  • 21
  • 3
0
votes
0 answers

unexpected end of pattern scikit-learn

I'm trying to use soundex to convert every word of a line to a hashed version and then using scikit-learn to perform some machine learning on it. The code goes: train = [] for line in text: a = ' ' sound = [] for word in line.split(): …
user3666471
  • 907
  • 11
  • 24
0
votes
0 answers

Soundex against individual values in CSV column

I suspect this may not be doable, but I figured I'd try anyway. In a MySQL database, one of the columns related is a comma-separated list of values: bob,sally,james,rick. For a given row, the number of items in this column is variable. Now, if I…
Philip Kahn
  • 614
  • 1
  • 5
  • 22
0
votes
2 answers

Double-metaphone errors

I'm using Lawrence Philips Double-Metaphone algorithm with great success, but I have found the odd "unexpected result" for some combinations. Does anyone else have additions or changes to the algorithm for other parts of it they wouldn't mind…
gbjbaanb
  • 51,617
  • 12
  • 104
  • 148
0
votes
0 answers

Get similar sounding records

I want to find all duplicate names from contacts table whose name matches by sound. Eg : Rita or Reeta , Microsoft or Microsift , Mukherjee or Mukherji. I have used below query : select contacts.id from contacts INNER JOIN (SELECT…
user3286692
  • 383
  • 1
  • 5
  • 23
0
votes
1 answer

trying to use SOUNDEX in query but getting an error in terms of database columns in mysql

Trying to figure out what went wrong, must be a silly syntax. $objDatabase = QApplication::$Database[1]; $strQuery = 'UPDATE `account` SET `sndx`=SOUNDEX("'.$objAccount->Name.'") WHERE `Id`='.$aid; $objDbResult = $objDatabase->Query($strQuery); The…
Satchel
  • 16,414
  • 23
  • 106
  • 192
0
votes
0 answers

Soundex search doen't give correct results

I've got a table product and the user needs to be able to search through products using an autocomplete function on a text field. But I want also when people make little mistakes in the name, it gets found aswell. What is needed All my titles are…
Rickert
  • 1,677
  • 1
  • 16
  • 23
0
votes
1 answer

Replacing word with same soundex

i have a list of sentences with wrongly spelled prepositions. i have a list of correctly spelled preps: ref_data = ['near','opposite','off','towards','behind','ahead','below','above','under','over','in','inside','outside'] i need to compute the…
Hypothetical Ninja
  • 3,920
  • 13
  • 49
  • 75