So I have a table where I have identified fuzzy matches and an amount. I want to be able to summarize the amount by this common key.
My Data looks like this:
Name | Match1 | Match2 | Amount |
---|---|---|---|
Jame | James | Jim | 1 |
Jame | James | Jim | 2 |
Mike | Mikes | Miike | 3 |
James | Jame | Jim | 4 |
Jim | Jame | James | 5 |
Mikes | Mike | Miike | 6 |
Miike | Mike | Mikes | 7 |
data = {'Name': {0: 'Jame',
1: 'Jame',
2: 'Mike',
3: 'James',
4: 'Jim',
5: 'Mikes',
6: 'Miike'},
'Match1': {0: 'James',
1: 'James',
2: 'Mikes',
3: 'Jame',
4: 'Jame',
5: 'Mike',
6: 'Mike'},
'Match2': {0: 'Jim',
1: 'Jim',
2: 'Miike',
3: 'Jim',
4: 'James',
5: 'Miike',
6: 'Mikes'},
'Amount': {0: 1, 1: 2, 2: 3, 3: 4, 4: 5, 5: 6, 6: 7}}
df = pd.DataFrame.from_dict(data)
What I'd like my resulting table to look like:
Key | Name | Match1 | Match2 | Amount |
---|---|---|---|---|
Jame | Jame | James | Jim | 1 |
Jame | Jame | James | Jim | 2 |
Mike | Mike | Mikes | Miike | 3 |
Jame | James | Jame | Jim | 4 |
Jame | Jim | Jame | James | 5 |
Mike | Mikes | Mike | Miike | 6 |
Mike | Miike | Mike | Mikes | 7 |
So I can summarize my data like this:
Name | Amount |
---|---|
Jame | 12 |
Mike | 16 |