I have a very large data set, close to 500 million edges in which almost all edges need to be traversed. I'm trying to parallelize these traversals by trying to paginate on IDS. My strategy was to try and paginate by ID which is an MD5 hash. I tried queries like the following:
g.E().hasLabel('foo').has(id, TextP.startingWith('AAA'))
for page 1
g.E().hasLabel('foo').has(id, TextP.startingWith('AAB'))
for page 2
But each query seems to be doing a full scan and not just a subset. How do you recommend I go about pagination?