Questions tagged [tez]

The Apache Tez™ project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN.

The 2 main design themes for Tez are:

Empowering end users by: Expressive dataflow definition APIs Flexible Input-Processor-Output runtime model Data type agnostic Simplifying deployment Execution Performance Performance gains over Map Reduce Optimal resource management Plan reconfiguration at runtime Dynamic physical data flow decisions

[For more details visit this link][1]: https://tez.apache.org/

94 questions
1
vote
0 answers

Using Hive to distribute over Reducers?

The most frustrating part about this problem, is that the obvious answer is "fix the source table!" - which unfortunately I cannot do (this is managed and maintained by another team at work who refuses to help). So I'm looking for a technical…
Josh
  • 1,493
  • 1
  • 13
  • 24
1
vote
1 answer

Can't execute MapReduce Job from Hive over Tez

I'm facing an Issue with Hive over Tez. I can select a table exist on Hive without any issue SELECT * FROM Transactions; But when trying to use aggregate functions on this tables or counting (*) like: SELECT COUNT(*) FROM Transactions; I'm Facing…
Mohamed El-Touny
  • 347
  • 1
  • 4
  • 14
1
vote
1 answer

How container reuse works in Apache Tez? While reusing what is the data stored in shared location?

While Apache tez reuses containers,what is the process takes place. Can anyone explain me clearly?
Ramprakash
  • 151
  • 1
  • 8
1
vote
0 answers

Are Apache Phoenix JARs causing error in Tez?

I've tried implementing the Phoenix Storage Handler in Hive according to the documentation here, which as a first step involves including JAR files to the HIVE_AUX_JARS_PATH. I included phoenix-hive.jar and phoenix-client.jar into the path and the…
TheElysian
  • 159
  • 1
  • 1
  • 7
1
vote
0 answers

ERROR: While running simple count(*) using hive2

I get Exception in thread "main" java.sql.SQLException: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.tez.TezTask at…
Saurab
  • 1,931
  • 5
  • 20
  • 33
1
vote
1 answer

Parquet Warning Filling up Logs in Hive MapReduce on Amazon EMR

I am running a custom UDAF on a table stored as parquet on Hive on Tez. Our Hive jobs are run on YARN, all set up in Amazon EMR. However, due to the fact that the parquet data we have was generated with an older version of Parquet (1.5), I am…
Shruf
  • 103
  • 2
  • 8
1
vote
1 answer

HIve2 with Tez gives Execution Error

I am using Hive2 with Tez. When I run the query it gives execution error which shown below. FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.tez.TezTask ERROR [432a4475-d246-4596-ad4c-54de6fea86c8 main] exec.Task: Failed…
Sahil Desai
  • 3,418
  • 4
  • 20
  • 41
1
vote
1 answer

Can we insert into external table

I am debugging a Big Data code in Production environment of my company. Hive return the following error: Exception: org.apache.hadoop.hive.ql.lockmgr.LockException: No record of lock could be found, may have timed out Killing DAG... Execution has…
mttb12
  • 75
  • 1
  • 9
1
vote
1 answer

Configure Apache Hive for LLAP without using slider

There's a new feature in hive called LLAP. During the investigation I've found out that it's quite difficult to configure LLAP so there's a component called Apache Slider that will configure it. Still I couldn't find any documentation for manual…
Vlad Gudikov
  • 103
  • 1
  • 7
1
vote
0 answers

Hive 2.1.0 Issue org.apache.hadoop.hive.ql.metadata.HiveException: Unable to move source

I am using hadoop 2.7.2 , hive 2.1.0 have enabled tez and but reverted back to use mr as execution engine and facing below error while trying to run a Insert Select query on orc and parquet tables, org.apache.hadoop.hive.ql.metadata.HiveException:…
1
vote
2 answers

Hive partitioning not working with dynamic variable

If I run set hivevar:a = 1; select * from t1 where partition_variable=${a}; Hive only pulls in the records from the appropriate partition. Alternately if I run set hivevar:b = 6; set hivevar:c = 5; set hivevar:a = ${b}-${c}; select * from t1 where…
1
vote
1 answer

Hive : Save query result in HDFS and load into a new table

I have a hive table which has 3 part partitions (dt,service_type,pv) I want to create a new table in a different database with filtered data from this original table. (filter by the columns used for partition) So I query data using SELECT as follows…
Ashika Umanga Umagiliya
  • 8,988
  • 28
  • 102
  • 185
1
vote
0 answers

Hive on tez engine

Currently in our production environment we use hive on tez instead of mapreduce engine ,so i wanted to ask will all the hive optimization for joins will be relevant for tez as well? for example in multitable table it was mentioned that if join key…
Abid
  • 140
  • 1
  • 11
1
vote
1 answer

Bucketing not working in hive

I have bucketed columns and even after setting all the parameter I am not getting any performance benefit. Below is the query I am using and the buckets I have created, I have added explain plan results as well. select count(*) from bigtable_main a…
1
vote
2 answers

How to set queue name for Pig on Tez?

How do I set a queue name from a command line when running Pig on TEZ? I would like to run a Pig script from the command line such as: pig -useHCatalog -p INPUT=input_dir \ -p 'OUT_FILE=out_file \ …
verkter
  • 758
  • 4
  • 15
  • 29