Questions tagged [pyhive]

107 questions
0
votes
1 answer

load pipe separated csv to hive

I am trying to load pipe separated csv file in hive table using python without success. Please assist. Full code: from pyhive import hive host_name = "192.168.220.135" port = 10000 user = "cloudera" password = "cloudera" database = "default"…
waleed khalid
  • 11
  • 1
  • 2
0
votes
1 answer

PyHive ignoring Hive config

I'm intermittently getting the error message DAG did not succeed due to VERTEX_FAILURE. when running Hive queries via PyHive. Hive is running on an EMR cluster where hive.vectorized.execution.enabled is set to false in the hive-site.xml file for…
Meitsrik
  • 91
  • 1
  • 7
0
votes
4 answers

Inserting a Python Dataframe into Hive from an external server

I'm currently using PyHive (Python3.6) to read data to a server that exists outside the Hive cluster and then use Python to perform analysis. After performing analysis I would like to write data back to the Hive server. In searching for a…
RKB
  • 73
  • 1
  • 11
0
votes
1 answer

"TSocket read 0 bytes" during a long running Hive insert query

I'm running a long-ish insert query in Hive using PyHive 0.6.1 and it fails with thrift.transport.TTransport.TTransportException: TSocket read 0 bytes after about 5 minutes running. On the server side the query keeps running until finishing…
gseva
  • 373
  • 4
  • 10
0
votes
1 answer

Python Hive-PyHive Connection reset issue

I am able to connect to hive from Python using PyHive library. While fetching data from hive using PyHive I am facing issue Connection reset by peer. After connection established I am fetching data from hive using PyHive from Python. Connection is…
Bond
  • 1
  • 3
0
votes
0 answers

python sasl C1083 error

I am trying to install sasl-0.1.3 for python on windows 10 but am getting a C1083 fatal error, it is referencing Visual Studio C++ and states failed to build wheel for sasl in the error message. I was wondering if there are any solutions that do…
M-M
  • 71
  • 1
  • 7
0
votes
0 answers

Error when I connect hive through pyhive: Segmentation fault (core dumped)

The error occurred like this: I use pyhive in vitual env of anaconda3. I can connect hive successfully several days ago. It crashed suddenly today without any changes. I tried to update the conda and pyhive to the latest version, but it doesn't…
Hannah
  • 51
  • 1
  • 1
  • 7
0
votes
0 answers

How to get recently created partitions in a Hive table?

I have a table called EMPLOYEE with columns ID, NAME, DESIGNATION, CITY, COUNTRY, CONTINENT. With 3 level partitioning on CONTINENT, COUNTRY, CITY. Now I need to know recently created partitions say after a specific timestamp. Note : Assuming access…
Shashank V C
  • 153
  • 1
  • 1
  • 9
0
votes
1 answer

Error while compiling statement: FAILED: ParseException line 1:84 missing EOF at

Using Cloudera Data Science Workbench, python 2 session i am calling a .sql file : Changing tables names for security: DROP TABLE IF EXISTS database1.table1 ; CREATE TABLE IF NOT EXISTS database1.table1 ; AS Select tb.column1 FROM database2.table2…
Brian DS
  • 51
  • 9
0
votes
1 answer

How to get the hive server side error msg using PyHive sqlalchemy?

I have a sql like select * from log where concat_ws('-',year,month,day) between 2017-09-13 and 2017-09-19 which is wrong because of not having 2017-09-13 and 2017-09-19 surrounded by ''. In beeline, it will result the error msg like Error: Error…
0
votes
1 answer

Querying hive complex data types like struct in Superset's SQL LAB

I have been using superset to query an external table through hive. This table has columns which are mostly of hive complex data types like the struct. How would I write a query in SQL LAB that does something like below? SELECT header.guid FROM…
Joseph
  • 698
  • 5
  • 12
0
votes
1 answer

SuperSet Hive Query issue - expected data in columnar format

I'm having a weird issue w/ running following Hive query via SuperSet (apache incubator): SELECT date, sum(1) visits, sum(price) revenue FROM visits WHERE date BETWEEN '2017-07-21' AND '2017-07-25' AND country = 'US' GROUP BY …
hummingBird
  • 2,495
  • 3
  • 23
  • 43
0
votes
1 answer

Does python could receive hive console output?

Because I using python to control hive data etl flow, and I use pyhive to connect hive and execute HQL. I want get more information from hive, like tez session execution output or hive's java error exception output. How should I hack pyhive package,…
0
votes
2 answers

pyhive, sqlalchemy can not connect to hadoop sandbox

I have installed, pip install thrift pip install PyHive pip install thrift-sasl and since pip install sasl failed I downloaded sasl‑0.2.1‑cp27‑cp27m‑win_amd64.whl file and installed it in my Windows 8.1 PC. Then I wrote this code, from pyhive…
Marlon Abeykoon
  • 11,927
  • 4
  • 54
  • 75
0
votes
1 answer

pyhs2 error connective to kerberos enabled hive

I am trying to connect to hive using python 2(miniconda2 installation ). Below is the code I am trying- connection = hive.connect(host='psvlxihpnn1', port= '10000', authMechanism='KERBEROS', user='***',password='****', configuration={'krb_host':…