Throwing out elements in a python list of pairs

Question

I've searched around for pointers on this question but couldn't find any. Suppose I have a list in Python:

list = set([((3, 2), (2, 1)),
            ((3, 2), (3, 1)),
            ((3, 1), (2, 1)), 
            ((2, 1), (1,3), (2, 3))])

I want to refine this list so that entries of the list containing pairs with the same first element are thrown out. So for example, the output for the list above should be

set([((3, 2), (2, 1)),
     ((3, 1), (2, 1))])

Because ((3, 2), (3, 1)) and ((2, 1), (1,3), (2, 3)) are elements in which at least two of the coordinate pairs have the same first entry. Is there a fast and easy way to do this?

As it stands, I am thinking of doing something like

[x for x in list if ... ]

where I loop over the list by fixing x[k][0] and going through and comparing each x[i][0] with varying i to x[k][0], then looping over all such k's. I feel there has to be a better way to do this. Hope I was clear enough in this question, and I greatly appreciate your help.

Dont use `list` as a variable name, and you dont have a list -- you have a set — the wolf, Dec 26 '12 at 20:45
Good point. Sorry for the misnomer-- my code is actually different, but I renamed the variables in my question. — user1693728, Dec 26 '12 at 21:19

Stuart · Answer 1 · 2012-12-26T21:18:20.877

3

You could use

def throw_out_elements(iterable):
    for x in iterable:
       if len(set(y for y, _ in x)) == len(x):
            yield x

Then to use this:

S = set([((3, 2), (2, 1)),
            ((3, 2), (3, 1)),
            ((3, 1), (2, 1)), 
            ((2, 1), (1,3), (2, 3))])
print list(throw_out_elements(S))

output: [((3, 2), (2, 1)), ((3, 1), (2, 1))]

edited Dec 26 '12 at 21:18

answered Dec 26 '12 at 20:53

Stuart

9,597
1
21
30

Note that checking the length of the set means that all elements must be consumed into the set (it's not lazy). – Gareth Latty Dec 26 '12 at 21:26

score 3 · Accepted Answer · edited May 23 '17 at 10:24

3

This can be done quite easily with a simple set comprehension and a simple function:

def no_duplicates(x):
    seen = set()
    return not any(i in seen or seen.add(i) for i in x)

data = {((3, 2), (2, 1)),
        ((3, 2), (3, 1)),
        ((3, 1), (2, 1)),
        ((2, 1), (1,3), (2, 3))}

print({item for item in data if no_duplicates(first for first, _ in item)})

Producing:

{((3, 2), (2, 1)), 
 ((3, 1), (2, 1))}

We take each item if the first element of each pair in the item is unique. We use the simple no_duplicates() function (pulled from this great answer) to do this, which does what it says on the tin.

edited May 23 '17 at 10:24

Community

1
1

answered Dec 26 '12 at 20:55

Gareth Latty

86,389
17
178
183

This is the best solution so far. +1 tomorrow when I have more votes. – Justin Lewis Dec 26 '12 at 20:56
Lots of good answers here! I think this is the nicest thus far, but my thanks to everyone. – user1693728 Dec 27 '12 at 18:52

jeffknupp · Answer 3 · 2012-12-26T21:17:13.730

1

If you're dead set on a single list comprehension, the following would work.

my_list = set([((3, 2), (2, 1)),
        ((3, 2), (3, 1)),
        ((3, 1), (2, 1)),
        ((2, 1), (1,3), (2, 3))])

[x for x in my_list if len(set([y[0] for y in x])) == len(x)]

Edit: First answer was wrong as I misread the question.

edited Dec 26 '12 at 21:17

answered Dec 26 '12 at 20:58

jeffknupp

5,966
3
28
29

This is a duplicate of my answer, except with the function inlined. Edit: As @Stuart points out below, inlined incorrectly, so this is wrong. – Gareth Latty Dec 26 '12 at 21:00
But unlike @Lattyware's answer I don't think it will do what the questioner is asking for... read it carefully – Stuart Dec 26 '12 at 21:00
Note that checking the length of the set means that all elements must be consumed into the set (it's not lazy). – Gareth Latty Dec 26 '12 at 21:25
1

It's not meant to be. If the length of pairs is significantly greater than the example given, this may of course be slightly less efficient than creating a function that breaks on the first duplicate (though I'd guess that for a reasonably high value of len, function call overhead would dominate the time spent). – jeffknupp Dec 26 '12 at 21:34

Throwing out elements in a python list of pairs

3 Answers3