How to remove groups of observation with dplyr::filter()

Question

For the following data

ds <- read.table(header = TRUE, text ="
id year attend
1 2007      1
1 2008      1
1 2009      1
1 2010      1
1 2011      1
8 2007      3
8 2008     NA
8 2009      3
8 2010     NA
8 2011      3
9 2007      2
9 2008      3
9 2009      3
9 2010      5
9 2011      5
10 2007     4
10 2008     4
10 2009     2
10 2010    NA
10 2011    NA
")
ds<- ds %>% dplyr::mutate(time=year-2000)
print(ds)

How would I write a dplyr::filter() command to keep only the ids that don't have a single NA? So only subjects with ids 1 and 9 should stay after the filter.

score 29 · Answer 1 · answered Jul 05 '14 at 06:11

29

Or you could use:

ds %>%
group_by(id) %>% 
filter(attend=all(!is.na(attend)))
#Source: local data frame [10 x 3]
#Groups: id

#  id year attend
#1   1 2007      1
#2   1 2008      1
#3   1 2009      1
#4   1 2010      1
#5   1 2011      1
#6   9 2007      2
#7   9 2008      3
#8   9 2009      3
#9   9 2010      5
#10  9 2011      5

answered Jul 05 '14 at 06:11

akrun

874,273
37
540
662

I like this one better, because it stays within dplyr and is shorter. Thanks! – andrey Jul 05 '14 at 07:00
1

And also `filter(!anyNA(attend))`. – Joe Jul 23 '19 at 14:13

score 8 · Accepted Answer · edited Mar 18 '16 at 05:39

8

Use filter in conjunction with base::ave

ds %>% dplyr::filter(ave(!is.na(attend), id, FUN = all))

To obtain

    id year attend
 1   1 2007      1
 2   1 2008      1
 3   1 2009      1
 4   1 2010      1
 5   1 2011      1
 6   9 2007      2
 7   9 2008      3
 8   9 2009      3
 9   9 2010      5
 10  9 2011      5

edited Mar 18 '16 at 05:39

alexwhan

15,636
5
52
66

answered Jul 05 '14 at 05:09

Robert Krzyzanowski

9,294
28
24

yes, 1 and 9, I already corrected it. Thanks, @Robert Krzyzanowski, this is exactly what i needed. I never seen ave() function used before, i'm glad I asked, learned something new. – andrey Jul 05 '14 at 05:13
I was waiting for 2 mins to pass to accept it :) thanks again! – andrey Jul 05 '14 at 05:17

How to remove groups of observation with dplyr::filter()

2 Answers2

Linked