1

I have a list of proteins in a column but they all end with _l how do I remove _l from proteins ? example:

nr2_l , nr1_l , pkca_l , p_lr1_l , p_ln_l

separate(protein_exp, c("protein", "l"), "_l")

but this then separates protein **p_lr1_l ** to :

protein l
p r1

I want it to be : | protein | l | | --------| -------- | | p_lr1 | _l |

Thanks

Drake
  • 47
  • 4

1 Answers1

1

String Remove from stringr library can do this.

library(tidyverse)
str_remove("test_l", pattern="_l$")
#or
str_remove(protein_exp, pattern="_l$")

The extra $ dollar sign means "only match _l at the end of the string".

If your data is in a dataframe called df, you can use mutate()

df %>% mutate(protein = str_remove(protein_exp, pattern="_l$"))
M.Viking
  • 5,067
  • 4
  • 17
  • 33