Remove double quote \" symbol from string

Question

I need to remove \" from a vector. This is my data:

data <- c("\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1803224&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Flinux-linux-security-masterclass-3-in-1%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1848638&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Fmastering-kali-linux%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1426684&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Finformation-gathering-with-kali-linux%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1628300&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Flinux-switchblade%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1615700&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Fadministrador-de-sistemas-junior-en-windows-server-y-linux%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.809770&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Flearn-bash-shell-in-linux-for-beginners-lite%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.574388&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Fhow-to-install-linux-ubuntu-server%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1436610&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Fcentos-and-ubuntu-managing-packages%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1771266&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Flinux-foundation-certified-system-administrator-exam%2F", 
"\"https://click.linksynergy.com/link?id=RUxZriH*PWc&offerid=323058.1734052&type=2&murl=https%3A%2F%2Fwww.udemy.com%2Flinux-server-security%2F"
)

As you can see, every object starts with \". How can I specifically remove these characters and leave the links?

milan · Accepted Answer · 2018-08-15T13:10:07.860

16

You can try this. Note that what you actually want is to remove \", not "\ (as proposed in the unedited version of your question). The first " you need to represent each element in the character.

gsub('[\"]', '', data)

edited Aug 15 '18 at 13:10

answered Aug 15 '18 at 12:52

milan

4,782
2
21
39

score 12 · Answer 2 · answered Aug 15 '18 at 12:58

12

Or we can just use '"' on the pattern

gsub('"', "", data)

answered Aug 15 '18 at 12:58

akrun

874,273
37
540
662

zx8754 · Answer 3 · 2018-08-15T13:33:23.277

If it is always 1st character then just use substring:

substring(data, 2)

This should be faster than any regex solution.

data <- rep(data, 1000)

microbenchmark::microbenchmark(
  a = substring(data, 2),  
  b = gsub("\"", "", data, fixed = TRUE),
  c = gsub('"', "", data),
  d = gsub('[\"]', '', data),
  e = stringr::str_replace(data, '[\"]', ''),
  f = gsub("^.","",data)
  )
# Unit: milliseconds
# expr       min        lq      mean    median        uq       max neval
#    a  2.835013  2.849838  2.933796  2.857393  2.900301  4.446956   100
#    b  4.728632  4.739751  4.788882  4.754861  4.795203  5.200185   100
#    c  7.388025  7.413684  7.503427  7.458444  7.555520  8.160925   100
#    d  7.390876  7.412686  7.530044  7.454453  7.533568  8.535544   100
#    e 12.019154 12.205608 12.430870 12.316084 12.581081 13.917336   100
#    f 15.712882 15.735975 15.875353 15.770043 15.861275 18.906262   100

score 3 · Answer 4 · answered Aug 15 '18 at 12:53

3

Use fixed = TRUE to match the pattern as a string:

gsub("\"", "", data, fixed = TRUE)

answered Aug 15 '18 at 12:53

neilfws

32,751
5
50
63

allanvc · Answer 5 · 2018-08-15T13:05:04.837

3

@milan was faster : )

An approach with stringr would be

library(stringr)
str_replace(data, '[\"]', '')

edited Aug 15 '18 at 13:05

answered Aug 15 '18 at 12:55

allanvc

1,096
10
23

score 3 · Answer 6 · answered Aug 15 '18 at 13:01

3

This works as well:

gsub("\"", "", data)

answered Aug 15 '18 at 13:01

Lennyy

5,932
2
10
23

score 2 · Answer 7 · answered Aug 15 '18 at 13:02

2

You could also remove the first character, skipping over the backslash headache:

gsub("^.","",data)

answered Aug 15 '18 at 13:02

P1storius

917
5
12

score 0 · Answer 8 · answered Jun 25 '20 at 14:17

0

I use combination of gsub() and noquote()

for (i in data){
   print(gsub('"','',(noquote(i))))
}

answered Jun 25 '20 at 14:17

Purnomo Setyawendha

98
6

Remove double quote \" symbol from string

8 Answers8