Filter to remove all rows before the first time a particular value in a specific column appears

Question

I would like to filter to remove all rows before the first time a particular value in a specific column appears. For example, in the data frame below, I would like to remove all rows before bob appears in column a for the first time. Please note that the value of bob repeats a second time -I only want to remove the rows before the first time bob appears.

(dat<-data.frame(a= c("pete", "mike", "bob", "bart", "bob"), b=c(1,2,3,4,5), c=c("home", "away", "home", "away", "gone")))
     a b    c
1 pete 1 home
2 mike 2 away
3  bob 3 home
4 bart 4 away
5  bob 5 gone

I want the resulting data frame to look like the following:

   a   b  c
1 bob  3 home
2 bart 4 away
3 bob  5 gone

score 10 · Accepted Answer · answered Apr 11 '19 at 07:24

10

dplyr way using slice.

library(dplyr)
dat %>% slice(which.max(a == "bob") : n())

#     a b    c
#1  bob 3 home
#2 bart 4 away
#3  bob 5 gone

which in base R would be

dat[which.max(dat$a == "bob") : nrow(dat), ]

answered Apr 11 '19 at 07:24

Ronak Shah

377,200
20
156
213

1

`dat[match(TRUE, dat$ == "bob")[1]:nrow(dat), ]` – Pablo Rod Apr 11 '19 at 08:02

score 5 · Answer 2 · answered Apr 11 '19 at 07:17

5

cumsum is usually a good candidate for such tasks

dat[cumsum(dat$a == "bob") >= 1, ]
#     a b    c
#3  bob 3 home
#4 bart 4 away
#5  bob 5 gone

answered Apr 11 '19 at 07:17

markus

25,843
5
39
58

score 3 · Answer 3 · answered Apr 11 '19 at 12:45

3

We can use cummax

library(dplyr)
dat %>%
     filter(cummax(a == "bob") > 0)
#     a b    c
#1  bob 3 home
#2 bart 4 away
#3  bob 5 gone

answered Apr 11 '19 at 12:45

akrun

874,273
37
540
662

Filter to remove all rows before the first time a particular value in a specific column appears

3 Answers3

Linked