To make things simple lets say I have a Client
table with Fields:
- ClientID
- PCode
- Region
I have a look up Region
table with Fields:
- ID
- PostCode
- Region
The Client
table has one row :
123, 3075, THOMASTOWN
The Region has 2 rows:
1,3074,THOMASTOWN
2,3075,LALOR
I am trying to cleanse some data using SSIS fuzzy lookup.
I use the Client as source and the lookup as Reference. When the Max number of matches to output per lookup is 1 the result is 123,3075,THOMASTOWN, 1,3074,THOMASTOWN
. In this case SSIS prefers the value of region over the value of pcode. But when I increase this option to a higher (2, 4, or 100) all the result rows are the same as the previous one.
I expect when I increase the number the other row of the lookup table shows up as one of the matches because the row has the same region code as the Client row.
To my surprise when for the first time I increased the option from 2 to 4 I saw the expected result (the other lookup record as a match) in 2 out of 4 rows of the fuzzy lookup output but since then it has never happened again and I always see the exactly same records as per different values of Max number of matches option.
Can anyone explain to me what's happening here and if I am doing something wrong?