Aggregations With Geomesa

Question

Does geomesa support aggregation functions?. Like 1.

Calculating matching feature COUNT for the given query.
Max/Min of an attribute for specified query collections.
Avg of an attribute value for specified query collections.

I tried with geotools aggregate functions but they are not getting updated for new records being inserted. I also tried using offset for the feature but It's not working(as was suggested by one of the developers that geomesa doesn't support offsets). What can be done to achieve such operations?

score 1 · Accepted Answer · answered May 15 '17 at 11:11

1

You could use the GeoMesa Spark formats, then use Spark built-in summary statisitics. See these spark mllib doc.

Also for version 1.3.2 it looks like more summary statistics are being added server-side. See this commit. That will be more efficient than Spark.

answered May 15 '17 at 11:11

vpipkt

1,710
14
17

Ok, I am new to spark, but would like to know why you mentioned summary statistics will be more efficient than spark. – Suresh Prajapati May 19 '17 at 07:28
Because they are accumulo iterators they execute on the data in place in the tablet servers to the greatest possible extent. A well managed spark job should have good localization too, but spark doesn't guarantee that. A lot will depend on how spark is deployed and configured. So more efficient from a Dev/devops perspective to use the stats iterators. – vpipkt May 19 '17 at 09:42

Aggregations With Geomesa

1 Answers1