Questions tagged [apache-hive]

Apache Hive supports analysis of large datasets stored in Hadoop's HDFS and compatible file systems such as Amazon S3 filesystem. It provides an SQL-like language called HiveQL with schema on read and transparently converts queries to map/reduce, Apache Tez[7] and Spark jobs. All three execution engines can run in Hadoop YARN. To accelerate queries, it provides indexes, including bitmap indexes.

Few features:-

1.Indexing to provide acceleration, index type including compaction and Bitmap index as of 0.10, more index types are planned. 2.Different storage types such as plain text, RCFile, HBase, ORC, and others. 3.Metadata storage in an RDBMS, significantly reducing the time to perform semantic checks during query execution. 4.Operating on compressed data stored into the Hadoop ecosystem using algorithms including DEFLATE, BWT, snappy, etc. 5.Built-in user defined functions (UDFs) to manipulate dates, strings, and other data-mining tools. Hive supports extending the UDF set to handle use-cases not supported by built-in functions. 6.SQL-like queries (HiveQL), which are implicitly converted into MapReduce or Tez, or Spark jobs.

96 questions
0
votes
1 answer

Sqoop incremental loading into partitioned hive table

How to load incremental data into a partitioned hive table I have table "users" with the following columns, I have created hive partition based on created_on field id bigint, name string, created_on string(yyyy-MM-dd), updated_on string I have…
0
votes
0 answers

scope of a variable of bag in PIG

I am beginner in PIG stuck on this problem I have loaded a hive table using HCat storer in pig , which has a field current_hour: long delta = LOAD 'alpha.table1' USING org.apache.hive.hcatalog.pig.HCatLoader(); now for some other calulation on a…
Vishal
  • 1,442
  • 3
  • 29
  • 48
0
votes
2 answers

Config hive with sentry: NullPointerException privilegeObj is null for

I've followed the guide step by step. But error occured when I tried to grant privilege to the role: 0: jdbc:hive2://localhost:10000/> GRANT ALL ON SERVER server1 TO ROLE admin_role; Error: Error while compiling statement: FAILED:…
kiwi
  • 487
  • 1
  • 4
  • 16
0
votes
2 answers

Hive Execution Error

I am new to avro and hive and while learning it i got some confusion. Using tblproperties('avro.schema.url'='somewhereinHDFS/categories.avsc'). If I run this create command like create table categories (id Int , dep_Id Int , name String) stored…
Anaadih.pradeep
  • 2,453
  • 4
  • 18
  • 25
0
votes
1 answer

unable to see the databases when using the hive as a metastore

In order to access the hive through the Apache Drill, I have started the metastore as a service. In order to use it as a metastore in the hive-site.xml, I have set the hive.metastore.uris parameter to thrift://localhost:9083. After the metastore…
Avinash
  • 127
  • 2
  • 13
0
votes
1 answer

Adding file to hive partition

Until hive 0.13 version I was able to add a file to a particular partition of a hive table using the below command : alter table table add partition (year=2016,month=07,day=25,file_part=1) location…
K S Nidhin
  • 2,622
  • 2
  • 22
  • 44
0
votes
0 answers

Hive GenericUDF Java code template required for function accepting a string and returning Map

I am trying to write GenericUDF for Hive. When I add JAR and try to create a temporary function pointing to the class, I get an error, so function creation does not succeed. Can someone provide Java code template for GenericUDF function that accepts…
Dhiraj
  • 3,396
  • 4
  • 41
  • 80
0
votes
1 answer

from string to map object in Hive

My input is a string that can contain any characters from A to Z (no duplicates, so maximum 26 characters it may have). For example:- set Input='ATK'; The characters within the string can appear in any order. Now I want to create a map object out…
Dhiraj
  • 3,396
  • 4
  • 41
  • 80
0
votes
0 answers

Casting string to bigint in hive returns incomplete data

How do I get the full value in hive? i.e. cast('0024' as bigint) will give only 24 but I need the complete 0024 as my output. Any idea how to do this?
0
votes
1 answer

Insert data in a case statement in Hive

Is it possible to insert a new row of data in a case statement in Hive. I have a basic 'team' table, with the following fields (team_id,fname,lname). This is what I am trying to run, SELECT team_id,fname,lname, CASE WHEN team_id = 2 THEN insert…
Frostie_the_snowman
  • 629
  • 3
  • 9
  • 17
0
votes
1 answer

What is the limit of BINARY data types in Hive 1.2?

I did not find much about BINARY data types in apache docs: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Types I created a table with BINARY column using- create table table1(col1 binary); After fetching metadata via JDBC I…
Dev
  • 13,492
  • 19
  • 81
  • 174
0
votes
1 answer

external tables in Hive

I added a CSV file in HDFS using R script. I update this CSV with new CSV/append data to it Created table using hue in Hive over this CSV. Altered it to be an external table. Now, if when data is changed in the hdfs location, would data be…
systemdebt
  • 4,589
  • 10
  • 55
  • 116
0
votes
0 answers

Hive: Execuation error while execuating a query on master node in distributed mode

hive> select * from twitter_data; Query ID = hdumer_20160523203816_0c2b2640-f711-41b9-81fd-5ee5d2af900a Total jobs = 1 Launching Job 1 out of 1 Number of reduce tasks is set to 0 since there's no reduce operator Job running…
umer safeer
  • 69
  • 2
  • 16
0
votes
2 answers

Hive Joins query

I have two tables in hive: Table 1: 1,Nail,maher,24,6.2 2,finn,egan,23,5.9 3,Hadm,Sha,28,6.0 4,bob,hope,55,7.2 Table 2…
Frostie_the_snowman
  • 629
  • 3
  • 9
  • 17
0
votes
1 answer

Apache hive Windowing functions lag and lead error

I am trying to select lead and lag value from a column using LAG and LEAD functions. Below are table and data. hive> select * from foo; OK a 1 2 3 4 5 6 7 8 9 Time taken: 0.923 seconds When I…
Wanderer
  • 447
  • 3
  • 11
  • 20