We are on version TD 14 and I come from Netezza / Postgre(Redshift) background. I have been asked to extract a login data from audit logs to find out records/transactions where the same ip is submitting similar looking usernames with small changes. e.g Samir --> Samr --> Amir etc To capture phishing activity. In POstgres we have fuzzy string functions like '%' e.g ColA % ColB (where % operator is equivalent to Similar) Soundex, Metaphone, levenshtein etc. In Teradata however I have just encountered or I have been able to find just Soundex. Is there any such in built function/method capability with Teradata version 14 to achieve the above string approximation.
Asked
Active
Viewed 3,342 times
1 Answers
0
Teradata 14.x supports the Damerau-Levenshtein Distance algorithm via the EDITDISTANCE()
function and n-gram pattern matching via the NGRAM()
function.
You can find information about the EDITDISTANCE function here and the NGRAM() function here.

Rob Paller
- 7,736
- 29
- 26