I have structure data like that ( web visitors )
List(p1,p1,p1,p2,p3,p3,p4,p4,p5...)
one visitor can visit 1 --> many times
data volumes: about 100 milions / day
How about or which db i can store unique visitors to fast access ( near real time ) like that
2014-11-15 | p1 | p2 | p3 | ...| pn
I try to work around by using Cassandra by using table like that :
CREATE TABLE uniqueVisitor (
key text,
p text,
PRIMARY KEY (key, data)
)
I think this store pattern is not work very well because :
Because of partitioning data of this table , All data of a key will store in only one server ( with replicate factor =1 ) == > too many write requests can blow out the server stored this key.
Please suggest me a solution (storage pattern )