I'm working with graphs and jung seems to work fine so far in my test environment(its a few thousand nodes), but the real data will create several million nodes every day(its a directed graph so we may be able to sperate the nodes by days so we don't have to download the entire graph). We will add a billion+ nodes every month and plan to use hbase/hadoop as our backend but not sure how I can get jung(or any graph library) to connect to it.
Is there a way to have jung use a database to read/write to so I can have some type of persistent storage that I can selectively download from?