Questions tagged [spark-hive]

Used when using spark-hive module or HiveContext

Apache Spark Hive is a module for for "Hive and structured data processing" on Spark, a fast and general-purpose cluster computing system. It is the super set of Spark SQL and is used to create HiveContext, similar to SqlContext.

76 questions

votes

2 answers

Running Hive Query in Spark through Oozie 4.1.0.3

Getting table not found exception while running Hive Query in Spark using Oozie version 4.1.0.3, as java action. Copied hive-site.xml and hive-default.xml from hdfs path workflow.xml used:

hadoop spark-hive

asked Oct 13 '15 at 11:15

Venkidusamy K

votes

1 answer

Dbeaver Exception: Data Source was invalid

I am trying to work with Dbeaver and processing data via Spark Hive. The connection is stable as the following command works: select * from database.table limit 100 However, as soon as I differ from the simple fetching query I get an exception.…

hive dbeaver spark-hive

asked Jun 07 '18 at 06:55

Lazloo Xp

votes

1 answer

Spark Streaming + Hive

We are in a process to build a application that takes data from source system through flume and then with the help of Kafka message system to spark streaming for in memory processing, after processing data into data frame we will put data into hive…

scala apache-spark-sql spark-streaming spark-hive

asked Mar 31 '18 at 07:32

Owais Ajaz

votes

1 answer

Errors during maven install when adding spark-hive_2.10 dependency in maven

I am using Scala IDE 4.6.0 and created a maven project using an archetype I got from the book: Spark In Action. I have to use Scala 2.10.4 and Spark 1.6.2. I created a basic project using this archetype and added the spark-hive dependency to the…

scala maven apache-spark pom.xml spark-hive

asked Jun 09 '17 at 15:45

jam_ab

votes

1 answer

using HiveContext in spark sql throws an exception

I have to use HiveContext instead of SQLContext because of using some window functions that are available only through HiveContext. I have added the following lines to my pom.xml: org.apache.spark …

apache-spark-sql hivecontext spark-hive

asked Mar 07 '17 at 12:30

A.B.

vote

0 answers

Does REFRESH TABLE update the cache entries of all tables?

I am looking for an approach to update the all the table metadata cache entry just before the write the operation. I have found the way via spark.catalog.refreshTable(table), however I am not sure whether it will update all the tables metadata store…

apache-spark pyspark apache-spark-sql spark-hive

asked Sep 14 '21 at 11:23

izhad

vote

0 answers

pyspark Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient

New to spark and tried other solutions from stackoverflow but no luck I have installed spark 3.1.2 and did few configuration setup (user spark/conf/spark-defaults.conf) to point aws rds mysql as a metastore (remote) spark.jars.packages…

mysql apache-spark pyspark spark-hive

asked Sep 14 '21 at 01:34

user1531248

vote

1 answer

Exception in Connecting Dataproc Hive Server using Java and Spark Eclipse

I am trying to access the Hive server present in GCP - Dataproc from my local machine(eclipse) using java and spark. But I am getting the below error while starting the application. I tried to find the problem but unable to solve it. Exception in…

java apache-spark google-cloud-platform google-cloud-dataproc spark-hive

asked Jun 18 '21 at 16:22

Rangan Roy

vote

1 answer

Spark saveAsTable with location at s3 bucket's root cause NullPointerException

I am working with Spark 3.0.1 and my partitioned table is stored in s3. Please find here the description of the issue. Create Table Create table root_table_test_spark_3_0_1 ( id string, name string ) USING PARQUET PARTITIONED BY…

apache-spark hadoop hive hadoop2 spark-hive

asked Oct 09 '20 at 21:39

Michael

vote

2 answers

Result-set inconsistency between hive and hive-llap

we are using Hive 3.1.x clusters on HDI 4.0, with 1 being LLAP and another Just HIVE. we've created a managed tables on both the clusters with the row count being 272409. Before merge on both…

hive azure-hdinsight qubole spark-hive

asked Jul 30 '20 at 17:51

Vinay K L

vote

0 answers

Spark stand-alone v 2.3.2 Failing test

I have build spark v 2.3.2 on big endian platform using adopt jdk 1.8 build is successful and we encounter test case failures in the following module. I wanted some information related to this failing test, information on how severely would this…

apache-spark-sql spark-hive sparkcore spark-repl

asked Feb 13 '19 at 12:11

Ketan Kunde

vote

1 answer

External table is empty when ORC data is saved

I want to write ORC data into an external Hive table from the Spark data frame. When I save the data frame as a table the data is sent to existing external table, however, when I try to save the data in ORC format into the directory and then read…

scala apache-spark orc spark-hive

asked Jan 25 '19 at 08:55

Cassie

2,941
8
44
92

vote

0 answers

Not able to read Hive table using sparkR submit

Here is my Code: sc <- sparkR.init(master = "local[*]", sparkEnvir = list(spark.driver.memory="8g")) hiveContext <- sparkRHive.init(sc) sqlQuery <- "SELECT * from table ABC" joinSQL <- sql(hiveContext,sqlQuery) This is giving error…

apache-spark-sql sparkr spark-hive

asked Sep 17 '18 at 07:05

Manoj

vote

0 answers

How to read snappy compressed sequence File in spark

We have our huge legacy files sitting in our hadoop cluster in compressed sequence file Format. The sequence files were created using hive ETL. Lets say I had table in hive created using the following DDL: CREATE TABLE sequence_table( col1…

scala apache-spark sequencefile spark-hive

asked Mar 10 '18 at 03:27

kalyan chakravarthy

vote

1 answer

How can I use an SQL subquery within Spark 1.6

How can I convert the following query to be compatible with Spark 1.6 which does not supported subqueries: SELECT ne.device_id, sp.device_hostname FROM `table1` ne INNER JOIN `table2` sp ON sp.device_hostname = (SELECT…

mysql apache-spark apache-spark-sql spark-hive

asked Jan 22 '18 at 11:02

user6666914

Prev 1

3 4 5 6 Next