Questions tagged [apache-hive]

Apache Hive supports analysis of large datasets stored in Hadoop's HDFS and compatible file systems such as Amazon S3 filesystem. It provides an SQL-like language called HiveQL with schema on read and transparently converts queries to map/reduce, Apache Tez[7] and Spark jobs. All three execution engines can run in Hadoop YARN. To accelerate queries, it provides indexes, including bitmap indexes.

Few features:-

1.Indexing to provide acceleration, index type including compaction and Bitmap index as of 0.10, more index types are planned. 2.Different storage types such as plain text, RCFile, HBase, ORC, and others. 3.Metadata storage in an RDBMS, significantly reducing the time to perform semantic checks during query execution. 4.Operating on compressed data stored into the Hadoop ecosystem using algorithms including DEFLATE, BWT, snappy, etc. 5.Built-in user defined functions (UDFs) to manipulate dates, strings, and other data-mining tools. Hive supports extending the UDF set to handle use-cases not supported by built-in functions. 6.SQL-like queries (HiveQL), which are implicitly converted into MapReduce or Tez, or Spark jobs.

96 questions

votes

1 answer

Apache Hive: Transferring data from existing table to another table

I have multiple existing tables stored in hdfs. I would like to create new tables from the existing external tables so that I can bucket, sort, and compress the data. What is the proper way to create a table from the existing table? I could export…

hadoop apache-hive

asked Jul 13 '15 at 21:43

kira_codes

1,457
13
38

votes

0 answers

Not able to run hive job as a oozie workflow in hue

I am getting the following error when I have configured the simple hive job as a oozie workflow in hue My Hive script file: CREATE EXTERNAL TABLE test (a INT) STORED AS TEXTFILE LOCATION '/user/mapradm/input-data/table'; INSERT OVERWRITE DIRECTORY…

oozie hue apache-hive

asked Jun 05 '15 at 14:35

Rajsekhar Rao Barnala

votes

0 answers

Not able to read hdfs files through pig on pseudo node cluster

I have this very basic test (immediately after installation of both hadoop 2.7 and pig 0.14) the file exists in hdfs - hdfs://master:50070/user/raghav/family 32 hdfs://master:50070/user/raghav/nsedata however, when i run the following,…

hadoop apache-pig hdfs apache-hive

asked May 19 '15 at 20:40

Raghav

2,128
5
27
46

votes

0 answers

Hive Server2 is dead and pid file exists [FAILED]

I've a simple 2 node cloudera CDH5 cluster and in slave machine I've setup Hive & Hue. I'm able to work in hive through CLI viz. creating databases and tables, I cannot configure hive/beeswax in hue and may be the reason is Hive Server2 is dead.…

hadoop hive cloudera cloudera-cdh apache-hive

asked Apr 17 '15 at 03:16

Vitthal

votes

1 answer

Hive Metastore Sql server :thrift.transport.TTransportException: No keytab specified

I am trying to run Hive metastore. But it is throwing error below. org.apache.thrift.transport.TTransportException: No keytab specified** at…

hadoop hive kerberos apache-hive

asked Nov 22 '14 at 15:05

user3858193

1,320
5
18
50

-1

votes

2 answers

Save data as text file from spark to hdfs

I processed data using pySpark and sqlContext using the following query: (sqlContext.sql("select LastUpdate,Count(1) as Count" from temp_t) .rdd.coalesce(1).saveAsTextFile("/apps/hive/warehouse/Count")) It is stored in the following…

apache-spark pyspark apache-spark-sql apache-hive

asked Jul 28 '16 at 05:03

Prasad

Prev 1 2 3 4 5 6