Questions tagged [pattern-mining]
31 questions
0
votes
0 answers
Frequent pattern mining on a large dataset with respect to columns (data dimension= 23 x 305269) always result in a dead kernel error
I'm trying to apply frequent pattern mining(FPM) algorithms on a biological data,where rows represent samples, and columns represent SNPs(location,position), i'm working on jupyter notebook:
first i imported necessary packages:
import pandas as…

Phoebe Magdy
- 1
- 1
0
votes
1 answer
Sequential Pattern Mining Using Python
I have a dataset
Name Subset Type System
A00 9-IU00-A OP A
A00 IT00 PP A
B01 IT-01A PP B
B01 IU OP B
B03 IM-09-B LP A
B03 IM03A OP B
B03 IT-09 OP A
D09…

spd
- 334
- 1
- 12
0
votes
1 answer
Can association mining deal with non-linear relationships?
For an association rule x->y, When the lift is 1, x and y are independent. If lift is lesser than 1, then both are negatively correlated else they are positively corrrelated. What if they have non-linear correlation? Would the association…

priya
- 375
- 5
- 22
0
votes
1 answer
PRXMATCH not working with PRXPARSE function in SAS
I have comments with multiple ids which I need to pull from comments. Each I’d in separate column is required.
Input data has 2 columns- comment_id & Comment(it has 1 or more IDs)
Desired output should have 2 columns: comment_id & ID
I am using…

Aman
- 1
- 4
0
votes
1 answer
Recursion in FP-Growth Algorithm
I am trying to implement FP-Growth (frequent pattern mining) algorithm in Java. I have built the tree, but have difficulties with conditional FP tree construction; I do not understand what recursive function should do. Given a list of frequent items…

Helen Grey
- 439
- 6
- 16
0
votes
1 answer
Candidate set generation in Apriori algorithm
I am trying to implement Apriori algorithm in Java, and have problems with generating Candidate itemsets. To create candidates for k-itemset I use all combinations of k-1 and 1-itemsets. For example, for
Frequent 1-itemset:
bread:9, milk:9,…

Helen Grey
- 439
- 6
- 16
0
votes
1 answer
Is there a way to use found sequential patterns as input for a clustering algorithm
I'm doing a project on categorizing users based on their surfing patterns on a site.
For this I need to find patterns in the data and then cluster them, but the clustering is a problem, since the clustering algorithms I tried (k-means, agglomerative…

Ben Blanc
- 63
- 7
0
votes
1 answer
sequential pattern or itemset fp tree
FP-growth algorithms are used for Itemset Mining. Is there a way to use these algorithms for Sequential Pattern Mining instead of Itemset Mining?

Sweet
- 11
0
votes
0 answers
Frequent Itemset Mining
I have a questionary result of a population
each person provide an answer to each question
each person has some attributes like age, sex, race, birthplace, etc
how can I cluster persons based on their answers and make learning methods for that
for…

Illidin
- 11
0
votes
1 answer
PrefixSpan sequence extraction misunderstanding
I have a set of tuples of size three in a list that represent windowed sequences.
What I need is using pyspask to be able to get (given the two first parts of the tuple) the third one.
So I need it to create sequences of three elements based on…

Kratos
- 1,064
- 4
- 20
- 39
0
votes
1 answer
Spark Mllib - Frequent Pattern Mining - Association Rules - Not getting the expected results
I've the following dataset:
[A,D]
[C,A,B]
[A]
[A,E,D]
[B,D]
And I am trying to extract some association rules using Frequent Pattern Mining using Spark Mllib. For that I've the following code:
val transactions =…

John_Rodgers
- 181
- 1
- 11
0
votes
1 answer
Pattern recognition in numbers?
Consider the input file be
25 27 29 25 27 29 25 27 29 25 27 29 25 27 28
I want to extract the pattern 25 27 29. What are the algorithms that I can use to mine the pattern in a sequence like this? I'm willing to implement it using unsupervised…

ThunderPunch
- 483
- 1
- 4
- 16
0
votes
1 answer
Is possible to read collection of strings and return a Regexp?
I have a collection of files from multiple sources.
Each file contains strings like:
File 1: A) B) C) D) E)
File 2: a) b) c) d) e)
File 3: a. b. c. d. e.
File 4: a- b- c- d- e-
(...)
I know I could code all possible patterns beforehand, but I'd…

Victor Ribeiro
- 577
- 7
- 20
0
votes
0 answers
maximum number of states in TraMineR?
I have a data set with the text users wrote in a text field on a website. Due to the nature of the website most users wrote multiple times in the field. Now I want to look if there is a pattern. For instance, users who wrote at some time "A" will in…

Stoeptegel
- 3
- 3
-1
votes
2 answers
How can I delete rows which have two or more words (after each other) in a sequence?
I want to remove the rows which have the same two or more words after each other, like a sequence. This is to do a sequential pattern mining analysis.
I already tried the distinct() and duplicated() function, but this only removes the
whole…

berendpsv
- 3
- 4