Removing descrepancies from my list

Question

Hi there. I was hoping I could ask for some advice in regards to a problem I am struggling with.

I have a List with more than a thousand values and there are some duplicates, not exact duplicates but discrepencies based on upper and lower case. so for example I would have

Training and training in the same list or Vision and Values and Vision and values.

So there are various instances where there are minor discrepies based on Case difference.

How could I go about in removing thsese 'excess' values?

check http://stackoverflow.com/questions/283063/linq-distinct-operator-ignore-case — spajce, Feb 11 '13 at 10:11

score 2 · Answer 1 · answered Feb 11 '13 at 10:11

2

Use Linq:

var listWithDups = new List<string>() = {"blah","Blah","etc","etc."};
var listWithoutDups = listWithDups.Distinct(StringComparer.CurrentCultureIgnoreCase).ToList();

answered Feb 11 '13 at 10:11

Paul Grimshaw

19,894
6
40
59

mortb · Answer 2 · 2013-02-11T10:17:21.740

1

I would add all entries to a Hashset

A Hashset is a collection that stores maximum one of every item added to it. You'd write a "ignore case" equity comparer that you'd pass into the Hashset construcor. Like:

     var set = new Hashset( yourListWithDuplicates, (x,y) => x.Equals(y, 
StringComparison.CurrentCultureIgnoreCase));

edited Feb 11 '13 at 10:17

answered Feb 11 '13 at 10:11

mortb

9,361
3
26
44

score 1 · Accepted Answer · answered Feb 11 '13 at 10:19

1

Tried this in LinqPad:

var list = new List<String> { "Hello", "World", "HELLO", "beautiful", "WORLD" };

var l = list.Distinct(StringComparer.CurrentCultureIgnoreCase).ToList();

Console.WriteLine(l);

answered Feb 11 '13 at 10:19

David Brabant

41,623
16
83
111

Removing descrepancies from my list

3 Answers3