0

Pardon me if this is a silly question.

I have a cloudera manager installed in a single node.

I am trying to use Hbase and Hadoop for logging request and response in my web application.

I am trying to list latest user activity using the log.

Rows are added using the below table structure.

1 Column Family, RowId, 11 columns. I store every value as string. Fairly simple & similar to a mysql table.

RowId
entry:addedTime
entry:value
entry:ip
entry:accessToken
entry:identifier
entry:userId
entry:productId
entry:object
entry:requestHeader
entry:completeDate
entry:tag

Now, in order to get rows from my Hbase, I use

SingleColumnValueFilter("entry", "userId", "=", binary:"25", true, true)

Now, I am struggling to order this by

entry:completeDate DESCENDING

and limit by 25 rows for pagination or infinite scroll.

My question,

Is Hbase the only real time querying database available in Hadoop ecosystem?

Am I using Hbase for wrong reasons? Is my table structure correct?

I work in a startup and these are our baby steps to moving to BigData. Though BigData created lot of hype, the Hadoop is poorly supported for latest linux and looks too complicated.

Any help or suggestions would be appreciated.

Many thanks,

Karthik

JensG
  • 13,148
  • 4
  • 45
  • 55
Karthik
  • 1,091
  • 1
  • 13
  • 36

0 Answers0