Remove part of a string based on another column in R

Question

I have a large dataset that looks like this. I want to remove a certain number of strings from the fruits columns indicated by the remove_strings column.

library(tidyverse)

df <- tibble(fruits=c("apple","banana","ananas"), 
             remove_strings=c(1,4,2))

df
#> # A tibble: 3 × 2
#>   fruits remove_strings
#>   <chr>           <dbl>
#> 1 apple               1
#> 2 banana              4
#> 3 ananas              2

^{Created on 2022-03-09 by the reprex package (v2.0.1)}

From apple I want to remove the first string, from banana the first 4 and ananas the first 2. I want my data to look like this:


#>   fruits remove_strings   new_fruits
#>   <chr>           <dbl>
#> 1 apple               1      pple
#> 2 banana              4        na
#> 3 ananas              2       anas

Maël · Accepted Answer · 2022-03-09T13:52:35.677

3

Using substr:

with(df, substr(fruits, remove_strings + 1, nchar(fruits)))
# [1] "pple" "na"   "anas"

Or, using str_sub:

library(stringr)
df %>% 
  mutate(removed = str_sub(fruits, remove_strings + 1))

# A tibble: 3 x 3
  fruits remove_strings removed
  <chr>           <dbl> <chr>  
1 apple               1 pple   
2 banana              4 na     
3 ananas              2 anas

edited Mar 09 '22 at 13:52

answered Mar 09 '22 at 13:45

Maël

45,206
3
29
67

Nad Pat · Answer 2 · 2022-03-09T13:57:42.377

2

df$new_fruits = substring(df$fruits, df$remove_strings + 1)
[1] "pple" "na"   "ana

edited Mar 09 '22 at 13:57

answered Mar 09 '22 at 13:50

Nad Pat

3,129
3
10
20

Carpa · Answer 3 · 2022-03-10T06:20:12.087

0

substr(fruits, remove_strings+1, nchar(fruits))

(I would like to say that I solved the problem independently of Maëls solution. I cannot prove that but it's the first time that this happens in any of my posts.)

edited Mar 10 '22 at 06:20

answered Mar 09 '22 at 13:49

Carpa

412
4
16

Remove part of a string based on another column in R

3 Answers3