I'm new to geospatial domain and I've managed to add geomesa-spark-jst
to the project which enabled me use geospatial functions.
I need to go through milions of geocoded events (eventRdd
) and based on a custom criteria see if they are within a certain distance from a road segment linestring (roadSegmentRdd
).
Currently for each event I need to go through the entire roadSegmentRdd
and see if the criteria is satistfied which is not optimal at all.
How can I use geomesa and indexes to make this query faster? What are the minimum needed dependencies?