0

Hey mates I am trying to implement an association rule mining in a vertica db: So far I got the heavy lifting done by finding the frequent itemsets however I still need to generate the rules. Here is an example:

Initial table:

+--------+-------+-------+
| item1  | item2 | item3 |
+--------+-------+-------+
| A      | B     | C     |
+--------+-------+-------+

With 3 items I can generate 6 rules, a rule has a rule body and a rule head. This would result in the following table:

+---------+---------+--------+--------+
| ItemB1  | ItemB2  | ItemH1 | ItemH2 |
+---------+---------+--------+--------+
| A       | Null    | B      | C      |
| B       | Null    | A      | C      |
| C       | Null    | A      | B      |
| A       | B       | C      | Null   |
| A       | C       | B      | Null   |
| B       | C       | A      | Null   |
+---------+---------+--------+--------+

This is a very abstract example, in reality there is more than one item set and more than 3 items in one item set. I found some papers that discuss association rule mining and claim that this part is trivial. Guess not so trivial to me.

Thanks

valenzio
  • 773
  • 2
  • 9
  • 21

0 Answers0