approximate algorithm for top-k query in hive?

Asked Aug 21 '12 at 07:28

Active Aug 21 '12 at 07:29

Viewed 376 times

everyone,in hive ,we use

select word,count(*) as cnt from table group by word order by cnt limit N

for top-N query.
As we kown the speed is not fast,i learn about some approximate algorithm for top-k query ,such as countsketch algorithm or another algorithm.
Could we add approximate algorithm to hive for speed up top-k query?

edited Aug 21 '12 at 07:29

amit

175,853
27
231
333

asked Aug 21 '12 at 07:28

user1562158

approximate algorithm for top-k query in hive?

0 Answers0