I noticed that when MR is involved, even for simply queries like:
select * from table where condition_a and condition_b limit 10;
Hive will scan all the data (sure), but it will only return when all the MR jobs finish, even we only need 10 results.
Is there a way to tell hive just return the first 10 rows, then skip the result of the job?
Note, I am taking about when MR job is involved, not for the simple case where no actual MR job is created.
Thanks!