I have a dataset in which values are same except the number of semicolons in it resulting to different records.
For example if in a column one records has a;b;c and another record has a;;b;c, this is disabling the use of distinct function in my code. I want this to be treated as duplicate record for which ;; needs to be replaced with ;
How can we replace multiple ; with single ; in strings in a dataset in hive?