Remove all rows following a row that contains a value

Question

I'm attempting to remove all rows that come after a value (or values) but am running into some trouble.

I want to do the opposite of this: Filter to remove all rows before the first time a particular value in a specific column appears

Using the example dataframe from the above question:

(dat<-data.frame(a= c("pete", "mike", "bob", "bart", "bob"), b=c(1,2,3,4,5), c=c("home", "away", "home", "away", "gone")))

         a b    c
    1 pete 1 home
    2 mike 2 away
    3  bob 3 home
    4 bart 4 away
    5  bob 5 gone

I want my result to look like this:

     a b    c
1 pete 1 home
2 mike 2 away
3  bob 3 home

Here is what I've tried so far:

dat %>% slice(which.min(a == "bob") : n())

But unlike which.max which removed everything before bob this doesn't remove anything after it.

Try `dat %>% slice(1 : which.max(a == "bob"))`. – dcarlson Apr 10 '21 at 00:14 — dcarlson, Apr 10 '21 at 00:14

akrun · Accepted Answer · 2021-04-10T00:21:57.537

3

We can use

library(dplyr)
dat %>% 
     slice(seq(which.max(a == 'bob')))

Or with cumsum

dat %>% 
    filter(lag(cumsum(a == 'bob'), default = 0) < 1)

Or in base R

dat[seq_len(match('bob', dat$a)),]

edited Apr 10 '21 at 00:21

answered Apr 10 '21 at 00:14

akrun

874,273
37
540
662

score 3 · Answer 2 · answered Apr 10 '21 at 01:47

3

Using row_number() :

library(dplyr)
dat %>% filter(row_number() <= match('bob', a))

#     a b    c
#1 pete 1 home
#2 mike 2 away
#3  bob 3 home

answered Apr 10 '21 at 01:47

Ronak Shah

377,200
20
156
213

Remove all rows following a row that contains a value

2 Answers2