Questions tagged [happybase]

HappyBase is a developer-friendly Python library to interact with Apache HBase. HappyBase is designed for use in standard HBase setups, and offers application developers a Pythonic API to interact with HBase. Below the surface, HappyBase uses the Python Thrift library to connect to HBase using its Thrift gateway, which is included in the standard HBase 0.9x releases.

https://happybase.readthedocs.io/en/latest/

95 questions
1
vote
1 answer

Unable to manipulate HBase using happybase package of python - thriftpy.transport.TTransportException

I try to connect to HBase and manipulate it through the happybase package of python Here is the code i used to connect to HBase: (the 'print' lines help me to locate where the program is stopped) import happybase connection =…
I.Scale
  • 35
  • 5
1
vote
1 answer

Is thrift running on my HBase master? How to connect to it with Happybase?

I am running krejcmat/hadoop-hbase docker in pseudo distributed mode. That is master and slaves are running in separate containers on the same machine. After starting the Hadoop cluster and HBase, I start the thrift server on the master node with:…
sepideh
  • 597
  • 1
  • 5
  • 12
1
vote
1 answer

Writing to Bigtable from Python

I am developing an IoT data pipeline using Python and Bigtable, and writes are desperately slow. I have tried both Python client libraries offered by Google. The native API implements a Row class with a commit method. By iteratively committing rows…
JD Margulici
  • 965
  • 7
  • 8
1
vote
2 answers

Hbase timeout errors keep occuring

I'm accessing Hbase using happybase with python. I've a very simple function: def connect(): connection = happybase.Connection('myhost',myport) table = connection.table('MY-TABLE') try: return str(table.row('my-row')) except…
Greg Peckory
  • 375
  • 1
  • 7
  • 17
1
vote
1 answer

setting different hbase timeouts in happybase

I looked into the code of HappyBase Connection class and also the documentation here: http://happybase.readthedocs.org/en/latest/user.html#establishing-a-connection but I was not able to find any way to set different Hbase timeouts like…
1
vote
1 answer

happybase crash when it's trying to scan a very big Hbase column

my code as following: for key,data in table.scan(columns=["raw:dataInfo"]): count+=1 ... The column raw:dataInfo maybe as big as 50MB, When I ran the above code happybase crashed and threw the following exception: Traceback (most recent call…
Jack
  • 5,540
  • 13
  • 65
  • 113
1
vote
2 answers

How to secure client connections to an HBase Thrift Server?

Anyone who knows the port and host of a HBase Thrift server, and who has access to the network, can access HBase. This is a security risk. How can the client access to the HBase Thrift server be made secure?
Matthew Moisen
  • 16,701
  • 27
  • 128
  • 231
1
vote
2 answers

Is it a correct pattern to build composite primary key using wide columns stores?

HBase and Cassandra are built as wide column stores, using the concepts of both rows and columns. A row is composed of a key similar to the concept of primary key in RDBMS and a value composed of several columns A representation can be the…
amirouche
  • 7,682
  • 6
  • 40
  • 94
1
vote
2 answers

happybase connect to hbase get table information failed

I am new to hbase, want to use happybase followed the tutorial here: https://happybase.readthedocs.org/en/latest/user.html#establishing-a-connection code is as following: connection = happybase.Connection(host='10.0.0.11',…
printemp
  • 869
  • 1
  • 10
  • 33
1
vote
2 answers

HappyBase and Atomic Batch Inserts for HBase

With the HappyBase API for HBase in Python, a batch insert can be performed by the following: import happybase connection = happybase.Connection() table = connection.table('table-name') batch = table.batch() # put several rows to this batch via…
Matthew Moisen
  • 16,701
  • 27
  • 128
  • 231
1
vote
2 answers

Closing connection to hbase database using happybase in python

def hbasePopulate(self,table="abc",MachineIP="xx.xx.xx.xx"): connection=happybase.Connection(MachineIP,autoconnect=True) tablename=Reptype.lower()+'rep' print "Connecting to table " print tablename try: …
1
vote
3 answers

HappyBase - Is there an equivalent of find_one or scan_one?

All the rows in a particular HBase table that I am making a UI for happen to have the same columns and will have so for the foreseeable future. I would like my html data visualizer application to simply query for a single random row to take note of…
Matthew Moisen
  • 16,701
  • 27
  • 128
  • 231
0
votes
0 answers

pyspark dataframe with hbase connector get wrong data

I have a pyspark dataframe with unique uid field, and I write a map partition function to sum the occurrence of users. the map partition code is: def get_mark_num(p): hbase_util = HappyBaseUtil("localhost", 9090) for r in p: …
littlely
  • 1,368
  • 3
  • 18
  • 36
0
votes
1 answer

How to display timestamp while scanning hbase?

for key, data in table.scan(): print('Found: {}, {}'.format(key, data)) I have an HBase table that I need to scan and print the timestamp. I have written the above code but it only produces output like... Found: b'row1',…
ffl
  • 91
  • 1
  • 4
0
votes
1 answer

HBase, HappyBase on Google Colab - Read works, Insert fails

I have installed Hadoop, Hbase inside Google Colab and trying to create Tables, and then read and insert records there. HBase shell commands are working and using it I have create a small table called 'cars2' and can read the data there. !echo…
Calcutta
  • 1,021
  • 3
  • 16
  • 36