I am looking for some dedupe software that is compatible with MS SQL Server. I have a rather extensive and messy table that contains addresses from all over the world in all different languages. The table is set up to handle dupes as parent/child records so some functionality to handle a match is required (ie not just deleting a dupe).
Edit: Here's the structure
ParentID | MasterID | PropertyName | Address1 | Address2 | PostalCode | City | StateProvinceCode | CountryCode | PhoneNumber
The MasterID
is unique for each record.
ParentID
contains the MasterID
for the parent record of each entry, and the parent record is where the MasterID = ParentID
.
CountryCode
is the two letter ISO country code (not telephone code).