BigQuery says that Approximate aggregate functions are scalable in terms of memory usage and time, but produce approximate results instead of exact results.
I can't find any function like this in drill or hive. With the cluster computing,we can easily get the exact results,why and when we should use this APPROX_FUNC? I also hope someone can tell me what algorithm is used in APPROX_TOP_COUNT?