Questions tagged [soundex]

Soundex is an phonetic algorithm for indexing names based on their pronunciation in spoken English.

Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.

Soundex is the most widely known of all phonetic algorithms mainly because it is a standard feature of popular database software (such as MySQL, MS SQL Server and Oracle) and some programming languages (such as PHP).

Soundex was developed by Robert C. Russell and Margaret K. Odell and patented in 1918 and 1922

Articles

159 questions
8
votes
4 answers

MetaPhone Functions (like SoundEx) functions and use in R?

I am wanting to use MetaPhone, Double Metaphone, Caverphone, MetaPhone3, SoundEx, and if anyone has done it yet NameX functions within 'R' so I can categorize and summarize like values to minimize data cleansing operations prior to analysis. I am…
CRSouser
  • 658
  • 9
  • 25
7
votes
3 answers

Some changes on Soundex Algorithm

This algorithm is set to run over the first word or till it fills the four encoded strings. For instance, the result of the input "Horrible Great" is: H612. It neglects the second word, or in other words it takes only the first letter from the…
user979014
  • 63
  • 1
  • 5
7
votes
3 answers

Is there a soundex function for python?

Is there a soundex function for python and if not how would you go about making a soundex code? Soundex Code Letters 1 B, F, P, V 2 C, G, J, K, Q, S, X, Z 3 D, T 4 L 5 M, N 6 R SKIP A, E, H, I, O, U, W, Y, H, W,…
Josh C
  • 153
  • 1
  • 2
  • 6
7
votes
4 answers

DotNet Soundex Function

I have a database table that has a column of SQLServer Soundex encoded last name + first name. In my C# program I would like to convert a string using soundex for use in my query. Is there either a standard string function for soundex in the dotnet…
automatic
  • 2,727
  • 3
  • 34
  • 31
6
votes
3 answers

Solr search using contains, sound like

Problem: I have a movie information in solr. Two string fields define the movie title and director name. A copy field define another field which solr search for default. I would like to have google like search with limited scope as follows. How to…
Sriwantha Attanayake
  • 7,694
  • 5
  • 42
  • 44
6
votes
5 answers

Use SOUNDEX() word by word on SQL Server

Here is my problem. For example I have a table Products that contains a field, Name: Products ID | Name | .. 1 | "USB Key 10Go" 2 | "Intel computer" 3 | "12 inches laptop computer" ... I'm currently implementing a simple search engine (SQL…
Charles
  • 65
  • 1
  • 1
  • 4
6
votes
1 answer

MySQL Full-text search and SOUNDEX

I am trying to implement a first and last name search using full-text search and SOUNDEX (in case if the name is misspelled). I was trying to do something like SELECT * FROM employees WHERE MATCH SOUNDEX(first_name, last_name) AGAINST…
user1751343
  • 149
  • 1
  • 8
6
votes
5 answers

Java: how to find the most probable string in a list of strings?

I have a list of strings in Java containing first name of a person with dissimilar spellings (not entirely different). For example, John may be spelled as Jon, Jawn, Jaun etc. How should I retrieve the most appropriate string in this list. If anyone…
jigsawmnc
  • 444
  • 5
  • 15
5
votes
0 answers

Compare Names using Levenshtein distance

In my application i need to identify a person by searching for their lastname and firstname. One requirement is to accept spelling errors to a certain degree. My attempts to identify a person given a firstname and lastname were: sql query using…
surfmuggle
  • 5,527
  • 7
  • 48
  • 77
5
votes
4 answers

How to query soundex() in mysql

What is the proper structure for searching within MySql using soundex()? I know how to produce a soundex(): select soundex('str'); just not sure how to include this in my query.
jimmytiler
  • 61
  • 1
  • 1
  • 7
5
votes
2 answers

How to group by column having spelling mistakes

While working with some legacy data, I want to group the data on a column ignoring spelling mistakes. I think SOUNDEX() could do the job to achieve the desired result. Here is what I tried: SELECT soundex(AREA) FROM MASTER GROUP BY…
Khadim Ali
  • 2,548
  • 3
  • 33
  • 61
5
votes
1 answer

Speech to Text on Android with custom unusual word matching

I would like to be able to use Android's Speech-To-Text engine to recognize a variety of unusual words in sentences. To give an example, the word "electroencephalograph" comes out of STT as "electronics supply graph". When I use Soundex or…
Buns of Aluminum
  • 2,439
  • 3
  • 26
  • 44
4
votes
3 answers

"sounds-like", "did you mean THAT" functionality using full text search in SQL Server 2005

I have implemented full text search over SQL Server 2005 database using CONTAINSTABLE keyword. I was wondering is there a way to add a "sounds like" or google's "did you mean THAT" functionality if the original query yields no results.
AlexWendy
4
votes
1 answer

Why is SQL Server's SOUNDEX giving me inconsistent results?

Can someone provide some insight on why the SOUNDEX is giving different results when certain characters are capitalized vs lower case? From what I read online, SOUNDEX ignores case but I am getting different results and can't wrap my head around…
CandleWax
  • 2,159
  • 2
  • 28
  • 46
4
votes
0 answers

Questions in soundex algorithm

I'm a beginner in C++ and I'm trying to understand a Soundex algorithm I found somewhere on the internet. I understand most of it but this was not explained just posted somewhere so there are a few lines of code that I don't quite get. The algorithm…
Cantaff0rd
  • 705
  • 1
  • 6
  • 14
1
2
3
10 11