5

i have written an crawler in c++ and i am looking for an distributed data store i found 2 1) hbase 2) hypertable

both are the implementation of Big table and i just reviewed hypertable.. because it was implemented in c++ and..hbase is most widely used my question is way we need hadoop...is there is any benifits if i run hypertable on top of hadoop?

ygaradon
  • 2,198
  • 2
  • 21
  • 27
ramya
  • 51
  • 2
  • i think your main question is: In terms of performance where hypertable function better(on plain file system or on hadoop)? – ygaradon Jun 04 '12 at 20:36

2 Answers2

4

It's not required to use Hadoop. Hypertable only requires to be ran on top of distributed filesystem. So, you can choose between Hadoop DFS, KosmosFS, GlusterFS, Ceph, and maybe some others (like MooseFS or Lustre).

Common benefits are determined by application requirements and existent infrastructure. From a troubleshooting point of view, community (especially Hadoop's) size is also the great factor.

Lev Khomich
  • 2,247
  • 14
  • 18
0

You can also look into cassandra (http://cassandra.apache.org/) another implementation somewhat similar to Big Table.

Community
  • 1
  • 1
Thejaswi R
  • 1,707
  • 1
  • 10
  • 8