0

I'm New to Big data and currently struggling to integrate janusgraph to an existing hadoop cluster. Can you please tell me how to do this or please provide a link to a resource.

D.Rana
  • 1

1 Answers1

2

Since you're just getting started with JanusGraph, you should start with its official documentation located at http://docs.janusgraph.org/latest/

JanusGraph requires a storage backend. There are several supported storage backends. Apache Hadoop, by itself, is not a storage backend for JanusGraph. The Apache HBase storage backend is most likely what you might have available.

Jason Plurad
  • 6,682
  • 2
  • 18
  • 37
  • Hi jason Thank you for the answer. I went through the documentation and i couldn't find a way to form a janusgraph cluster. Does a janusgraph cluster means just a janusgraph instance deployed with access to a hbase or cassandra cluster? Or can i configure a cluster of janusgraph/gremlin servers? – D.Rana Sep 19 '17 at 11:21
  • The storage and index backends are clustered. You can configure multiple Gremlin Servers to connect to the same backends, but they would not be clustered, that is the Gremlin Servers would not be aware of each other. – Jason Plurad Sep 19 '17 at 14:40
  • Thank you Jason. But for the graphs each node should have a unique id and it is generated by janusgraph. So if I run multiple instances of janusgraph pointing to the same backends, won't a data consistency issue arise because each server has no idea what the other server is doing. – D.Rana Sep 20 '17 at 15:42