generating a vector of difference between two vectors

Question

I have two csv files, and each of which consists of one column of data

For instance, vecA.csv is like

id
1
2

vecB.csv is like

id
3
2

I read the data set as follows:

vectorA<-read.table("vecA.csv",sep=",",header=T)
vectorB<-read.table("vecB.csv",sep=",",header=T)

I want to generate a vector consisting of elements belonging to B only.

score 81 · Accepted Answer · edited May 23 '17 at 12:03

81

You are looking for the function setdiff

setdiff(vectorB$id, vectorA$id)

If you did not want this reduced to unique values, you could create a not in function

(kudos to @joran here Match with negation)

'%nin%' <- Negate('%in%')

vectorB$id[vectorB$id %nin% vectorA$id]

edited May 23 '17 at 12:03

Community

1
1

answered Feb 19 '13 at 04:15

mnel

113,303
27
265
254

+111 for `%nin%` !! I think I already have some uses for that one. – N8TRO Feb 19 '13 at 07:44
In Frank Harrell's Hmisc package, there's %nin%. – swihart Oct 24 '14 at 23:26
if a and b are vectors, a[!a %in% b] – Cybernetic Oct 04 '18 at 21:30

score 13 · Answer 2 · edited Oct 29 '14 at 14:26

13

If your vector's are instead data.tables, then all you need are five characters:

B[!A]

library(data.table)

# read in your data, wrap in data.table(..., key="id") 
A <- data.table(read.table("vecA.csv",sep=",",header=T), key="id")
B <- data.table(read.table("vecB.csv",sep=",",header=T), key="id")

# Then this is all you need
B[!A]

[Matthew] And in v1.8.7 it's simpler and faster to read the file as well :

A <- setkey(fread("vecA.csv"), id)
B <- setkey(fread("vecB.csv"), id)
B[!A]

edited Oct 29 '14 at 14:26

swihart

2,648
2
18
42

answered Feb 19 '13 at 07:36

Ricardo Saporta

54,400
17
144
178

4

Very slick. data.table rocks! And it's blazing fast. – N8TRO Feb 19 '13 at 07:40

generating a vector of difference between two vectors

2 Answers2

Linked

Related