I'm implementing patterns mining algorithm, and usually input data are file with the following format
item1 item2 item3
item0 item3 item10
....
item30 item40 item30
where usually itemx
is a String
. To be efficient, I used to read the file with ByteString
which is faster than the default String
. Since the great task in patterns mining algorithms is comparison between items sets. I wonder How faster or slower my program will be if I change the input file format in order to make comparison between Int
instead of comparison between ByteString
. Here is the novel format :
1 2 3
0 3 10
....
30 40 30
thanks !