0

I am using Open Refine to clean up some addresses in a sheet (.csv). How do I cluster exact match in a column? It seems that the clustering algorithm will cluster addresses that are almost similar which only differs in the number. For example:

56, LORONG RENGAS, SELANGOR
76, LORONG RENGAS, SELANGOR
90, LORONG RENGAS, SELANGOR
76, lorong rengas, selangor
56, LORONG RENGAS, SELANGOR

How do I use OpenRefine to cluster these addresses according to exact match? Question sounds silly but I'm losing my head over this thing.

Thanks

antsemot
  • 61
  • 1
  • 1
  • 6

1 Answers1

1

Did you try using the facet function? Facet group records based on an extact match. You can watch those video on faceting and data profiling.

magdmartin
  • 1,712
  • 3
  • 20
  • 43