I have a MongoDB database (100 GB), there are some duplicates in there which I wish to remove. What would be an easy way to go about this ?
I am using MongoDB 3.0 and pymongo to interact with it. The collection looks like :
{"kmer":"AATACAATT","species":"Sachromyces",..........}