I can't seem to count the frequency of my itemsets

Question

I have a transaction data and I'm trying to get a count of all the possible combination. The problem I'm getting is that it seems to over count my combinations. For example: given the following item sets:

A {1,2,3}

B {1,2,3,4}

if I want to count the number of times that {1,2,3} occurs together, it results in a count of 2, and not 1 as I want it to.

created dummy data below as an example

t1 <- data.frame(ID = c("A","A", "A", "B", "B", "B", "B"), num = c(1,2,3,1,2,3,4))
transactions<-split(t1[,"num"], t1[,"ID"], sep =",")
test <- apriori(transactions, parameter = list(support =.0000000001, minlen=3, maxlen = 3, target = 'frequent'))
inspect(test)

for this example, I'm expecting the {1,2,3} to have a count of 1 (# of times that just 1,2,3 are purchased together), but I'm not sure why it's giving me all the other numbers.

score 0 · Answer 1 · answered Sep 10 '19 at 23:57

Here is one way using base R. Collapse the num column into one comma-separated string by ID and count their occurrences using table

df1 <- aggregate(num~ID, t1, function(x) toString(unique(x)))
table(df1$num)

#   1, 2, 3 1, 2, 3, 4 
#         1          1

We can also use dplyr to do this

library(dplyr)

t1 %>%
  group_by(ID) %>%
  summarise(num = toString(unique(num))) %>%
  count(num)

score 0 · Answer 2 · answered Sep 11 '19 at 14:19

0

Frequent itemset mining counts subsets. {1,2,3} is a subset of both transactions and that is why you get a count of 2. It seems like you want to do something else.

answered Sep 11 '19 at 14:19

Michael Hahsler

2,965
1
12
16

I was expecting to only count the sets that have exactly 3 items since I set my min and max to 3. Is that not how this is designed to work? – semidevil Sep 11 '19 at 18:20
No. If you just want to count, then you can do what Ronak suggests. – Michael Hahsler Oct 07 '19 at 20:08

I can't seem to count the frequency of my itemsets

2 Answers2