We have lots of duplicates in a master table. See the below example, Customer ABC Corp exists 3 times in the master table and all 3 CustIDs are referenced in Orders table.
**Customers Table**
<table>
<tr><td><strong>CustID</strong></td><td><strong>CustName</strong></td></tr>
<tr><td>1001</td><td>ABC Corp.</td></tr>
<tr><td>1002</td><td>XYZ Corp.</td></tr>
<tr><td>1003</td><td>ABC Corp Ltd.</td></tr>
<tr><td>1002</td><td>ABC Corporation Limited.</td></tr>
</table>
**Orders Table**
<table>
<tr><td><strong>OrderID</strong></td><td><strong>CustID</strong></td></tr>
<tr><td>23425</td><td>1001</td></tr>
<tr><td>23466</td><td>1003</td></tr>
<tr><td>23488</td><td>1003</td></tr>
<tr><td>43877</td><td>1004</td></tr>
</table>
How can I use Data Quality Services to Remove duplicate Customers from Customers table and ALSO to update Orders table to reflect the changes.
i.e. CustIDs 1003 and 1004 merged into 1001 and 1003 and 1004 from Orders table need to be updated to 1001 too.
What I have done so far. Created a knowledge base for Customers with Synonyms for CustName and defined a leading value. And then created data cleansing project in DQS and analysed the data and then made corrections to master data and imported those corrected values to the Customers Knowledge base. I understood that DQS Cleansing Transformation in SSIS does this automatically by connecting to the source table and DQS KB and gives matching and non matching rows, But I still don't understand what to do from here. How the data in Customers table corrected and Orders table updated accordingly. Please suggest. The examples I have seen so far are using pretty basic stuff with just a table and a simple excel sheet.