Questions tagged [pyhive]
107 questions
0
votes
0 answers
How to read non-ACSII characters into jupyterhub from the Hadoop database
I want to read the data(from the Hadoop database) which has characters other than ACSII characters. I am trying to read the data using .py file. I have used
#!/usr/bin/env python
# -*- coding: utf-8 -*-
to specify the encoding.
I have used below…

Raghavendra S
- 47
- 10
0
votes
1 answer
Pyhive Kerberos connection issue
I'm getting following error while connecting to hive database configured with Kerberos using Python pyhive module.
kerberos_service_name should be set if and only if in KERBEROS mode
My Connection code:
import hive from pyhive
con =…

Suraj
- 3
- 4
0
votes
1 answer
Connection Issue While reading Hive table In HDInsight using Python
All. I wanted to connect to Hive Database in HDInsight using Python I followed multiple blog and few Stackoverflow blogs also .but No luck . Below are my tries using pyhive and JayDeBeApi library .
Using JayDeBeApi
I have added hive-jdbc-1.2.1,…

HimanshuSPaul
- 278
- 1
- 4
- 19
0
votes
1 answer
pyspark write to external hive cluster from databricks running on azure cloud
I have pyspark notebooks running in databricks.
I connect to an external hive cluster using 'hive.Connection' from pyhive.
I have my data in spark dataframes.
My question is how do I write this data from dataframes in a new table in Hive which…

user1860447
- 1,316
- 8
- 25
- 46
0
votes
0 answers
Getting row counts throws "Premature EOF from inputStream" Error
I'm having some trouble getting a row count from a temporary hive table. I'm not sure what is actually causing this error because when I run the identical set of queries against smaller test clusters, I get back the expected results. I only see…

Lucian Thorr
- 1,997
- 1
- 21
- 29
0
votes
0 answers
Slow performance reading large Hive table with pyhive in comparison with RJDBC
I'm trying to read a large table from Hive in python using pyhive, the table has about 16 millions of rows. But it is taking about to 33 minutes.
When I read the same table in R with RJDBC it takes about 13 minutes to read the whole table.
Here is…

C.Lopez
- 1
- 1
- 3
0
votes
1 answer
how to use pyhive in lambda function?
I've wrote a function that is using pyhive to read from Hive. Running it locally it works fine. However when trying to use lambda function I got the error:
"Could not start SASL: b'Error in sasl_client_start (-4) SASL(-4): no mechanism available:…

Nir99
- 185
- 3
- 15
0
votes
1 answer
HIVE not connecting with Python?
I have installed Hadoop and HIVE on windows 10 by following tutorials,
https://exitcondition.com/install-hadoop-windows/ & https://www.youtube.com/watch?v=npyRXkMhrgk respectively.
Both Hadoop and HIVE are running on my machine, I have been able to…

Ayaz49
- 325
- 2
- 4
- 18
0
votes
1 answer
SASL error when trying to connect to hive(hue) by python from my PC - Windows10
Need your help!
I read all the documentation that I found on the internet (StackOverflow, Github, etc.), but nothing helped.
I am trying to connect to hive(hue) by python from my PC, my script is:
When I run this code I got this error message:
…

Benzi
- 398
- 6
- 14
0
votes
1 answer
Pyhive Presto insert select * from not running
I can us PYHIVE to connect to PRESTO and select data back just fine. I am trying to use PYHIVE to run "insert into x select from y" on presto and it is not running. I am sure I am missing something simple.
from pyhive import presto
import…

Clark
- 1
- 1
0
votes
0 answers
How to connect to HIVE using python?
I'm using a CDH cluster which is kerberous enabled and I'd like to use pyhive to connect to HIVE and read HIVE tables. Here is the code I have
from pyhive import hive
from TCLIService.ttypes import TOperationState
cursor = hive.connect(host =…

HHH
- 6,085
- 20
- 92
- 164
0
votes
1 answer
python - presto - timestamps and decimal(38,18) returned as strings?
Why are presto timestamp/decimal(38,18) data types returned a string (enclosed in u'') instead of python datetime/numeric types?
presto jdbc:
select typeof(col1),typeof(col2),typeof(col3),typeof(col4),typeof(col5),typeof(col6) from hive.x.y
result…

tooptoop4
- 234
- 3
- 15
- 45
0
votes
1 answer
Encoding impala data while reading from pandas.read_sql
When I am reading impala data using pyhive library and pandas.read_sql I am getting an error UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe2 in position 3071: unexpected end of data
Reason for this error might be that data might be…

Shankar Pandala
- 969
- 2
- 8
- 28
0
votes
2 answers
How to use pyhive sqlalchemy to connect to Databricks clusters?
I'm using following code to connect to Databricks clusters. But got thrift.transport.TTransport.TTransportException: TSocket read 0 bytes
. I think there is some error of the my_url in create_engine.
from sqlalchemy.engine import…

dlwlrma
- 808
- 2
- 10
- 21
0
votes
1 answer
unable to connect to hive using pyhive
I am getting "EOFError()" error while connecting to remove hiveserver2. My pyhive version is 0.6.1
hiveserver2 is using http as the transport mode
This is the exception
ile "", line 1, in
File…

Nipun
- 4,119
- 5
- 47
- 83