R cumunique like cumsum

Question

I would like a function that works equivalent to cumsum but rather than adding up it counts the number of unique values so far. I could write a loop for each potential set but that seems like it could get time consuming as my dataset has millions of observations.

Example:

a <- c(1,3,2,4,1,5,2,3)
f(a)
[1] 1 2 3 4 4 5 5 5

score 10 · Accepted Answer · answered Feb 26 '16 at 06:51

10

You can try:

cumsum(!duplicated(a))
#[1] 1 2 3 4 4 5 5 5

answered Feb 26 '16 at 06:51

nicola

24,005
3
35
56

akrun · Answer 2 · 2016-02-26T07:28:43.737

2

We can try

library(zoo)
a[duplicated(a)] <- NA
a[!is.na(a)] <- seq_along(a[!is.na(a)])
na.locf(a)
#[1] 1 2 3 4 4 5 5 5

Or another option is

cumsum(ave(a, a, FUN=seq_along)==1)
#[1] 1 2 3 4 4 5 5 5

Or a compact option would be

library(splitstackshape)
getanID(a)[, cumsum(.id==1)]
#[1] 1 2 3 4 4 5 5 5

edited Feb 26 '16 at 07:28

answered Feb 26 '16 at 06:52

akrun

874,273
37
540
662

R cumunique like cumsum

2 Answers2

Linked

Related