Questions tagged [impyla]

Impyla is a Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines.

Impyla is a Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines.

Features:

  • HiveServer2 compliant; works with Impala and Hive, including nested data

  • Fully DB API 2.0 (PEP 249)-compliant Python client (similar to sqlite or MySQL clients) supporting Python 2.6+ and Python 3.3+.

  • Works with Kerberos, LDAP, SSL

  • SQLAlchemy connector

  • Converter to pandas DataFrame, allowing easy integration into the Python data stack (including scikit-learn and matplotlib); but see the Ibis project for a richer experience

References:

Related tags:

52 questions
0
votes
1 answer

Set request timeout for Impala using impyla (python driver)

Is it possible to set the request timeout to Impala? Using Python and impyla driver
Oleksandr Yarushevskyi
  • 2,789
  • 2
  • 17
  • 24
0
votes
1 answer

Through Impyla,How to fetch millions of data row by row?

For example,some python package such as vertica_python have the function of iterate(). With this,we can fetch data row by row using for row in cursor.iterated(): Which is very useful to process big query with millions of lines. Is there…
james.peng
  • 373
  • 1
  • 3
  • 13
0
votes
1 answer

Is it possible to use common table expressions with impala using Python?

It appears when using Python's impala.dbapi connect, you can only run one command per execute. I am using Python 2.7. I would like to create two common table expressions then join them, but I am unable to get this to work. If I run the SQL as it…
Brian DS
  • 51
  • 9
0
votes
1 answer

How to escape characters in python impyla query for Cloudera Impala running on Hadoop

I am trying to escape characters in python impyla query for Cloudera Impala on Hadoop but nothing seems to work.. The template syntax doesn't escape (Unusual for a database API..) cursor.execute('SELECT * from table where col1 = %s', tuple(["John's…
user1688726
  • 319
  • 2
  • 4
  • 11
0
votes
1 answer

Connecting to Kerberized hadoop cluster using python module impyla

I am using impyla module to connect to kerberized hadoop cluster. I want to access hiveserver2/hive but I was getting the below error: test_conn.py from impala.dbapi import connect import os connection_string = 'hdp296m1.XXX.XXX.com' conn =…
0
votes
0 answers

Extremely slow writing speed when inserting rows into Hive table using impyla

I'm experiencing extremely slow writing speed when trying to insert rows into a partitioned Hive table using impyla. This is an example of the code I wrote in python: from impala.dbapi import connect targets = ... # targets is a dictionary of…
Gianluca
  • 6,307
  • 19
  • 44
  • 65
-1
votes
0 answers

Error - Access to impala data with impyla

I am trying to connect impala with impyla and the following code. from impala.dbapi import connect # Set the connection parameters host_name = "xxx.com" port = 21050 database = "default" user = "xxx" password = "xxx" connection_url =…
Wakame
  • 413
  • 2
  • 5
  • 15
1 2 3
4