I have a Hadoop+Hive+Tez setup from scratch (meaning I deployed it component by component). Hive is set up using Tez as execution engine.
In its current status, Hive can access table on HDFS, but it can not access table stored on MinIO (using s3a
filesystem implementation).
As shows the following screenshot,
when executing
SELECT COUNT(*) FROM s3_table
,
- Tez execution stuck forever
Map 1
always inINITIALIZING
stateMap 1
always has a total count of-1
and pending count of-1
. (why-1
?)
Things already checked:
- Hadoop can access MinIO/S3 without problem. For example,
hdfs dfs -ls s3a://bucketname
works well. - Hive-on-Tez can compute against tables on HDFS, with mappers and reducers generated successfully and quickly.
- Hive-on-MR can compute against tables on MinIO/S3 without problem.
What could be the possible causes for this problem?
Version informations:
- Hadoop 3.2.1
- Hive 3.1.2
- Tez 0.9.2
- MinIO RELEASE.2020-01-25T02-50-51Z