As a newbie I'm struggling to solve the following issue with a big dataset. I would want to look for the value in the "CHROM" column comparing two columns of CLONEID (the second column contains duplicates). The corresponding "CHROM" value would be attributed to each "CLONEID" and same value would be attributed to its duplicate. I write this table as an example:
CLONEID | CHROM
976803 | 2A
976877 | 5B
976952 | 6B
976961 | 3B
976975 | 1A
977084 | 7B
977228 | 4A
977241 | 3A
I would like to obtain the following output:
CLONEID | CHROM
976803 |2A
976877 |5B
976952 |6B
976961 |3B
976975 |1A
977084 |7B
977084_1 |7B
977228 |4A
977228_1 |4A
977228_2 |4A
977228_3 |4A
977241 |3A