I got a wordlist which is 56GB and I would like to remove doubles. I've tried to approach this in java but I run out of space on my laptop after 2.5M words. So I'm looking for an (online) program or algorithm which would allow me to remove all duplicates.
Thanks in advance, Sir Troll
edit: What I did in java was put it in a TreeSet so they would be ordered and removed of duplicated