Questions tagged [happybase]

HappyBase is a developer-friendly Python library to interact with Apache HBase. HappyBase is designed for use in standard HBase setups, and offers application developers a Pythonic API to interact with HBase. Below the surface, HappyBase uses the Python Thrift library to connect to HBase using its Thrift gateway, which is included in the standard HBase 0.9x releases.

https://happybase.readthedocs.io/en/latest/

95 questions
0
votes
1 answer

KeyError using happybase because column families are returned as byte strings

I'm trying to use happybase to access my Hbase table. I'm using the python3 branch from the happybase github repo. Here's my test code: import happybase from settings import HBASE_THRIFT_SERVER connection =…
mankand007
  • 952
  • 2
  • 10
  • 22
0
votes
1 answer

HappyBase installation on multinode cluster

I have a multinode cluster with cloudera hbase. It has: 1 master, 5 thrift servers, 5 region servers, 1 gateway On what of these servers I have to install happybase? I've read the happybase doc, but didn't find this information. I need to read and…
VeLKerr
  • 2,995
  • 3
  • 24
  • 47
0
votes
1 answer

How to insert a list of JSONs into HBase using python

I have a list of JSON's which need to be read and inserted as JSON objects into HBase. Each JSON (could be a nested JSON) needs to be read from the list and inserted along with a new row key put (key, ) Format: [ { "x":"x-val", …
voidone
  • 333
  • 1
  • 3
  • 12
0
votes
1 answer

Why did Happybase's connection to Hbase Thrift just stop working?

I had code working and reading from Hbase yesterday, streaming large quantities of data. I have not touched the code and yet, somehow, when I came back to it today it doesn't want to print out the data. The only thing I did on the server was set up…
Ravaal
  • 3,233
  • 6
  • 39
  • 66
0
votes
2 answers

How do I get around the happybase "TApplicationException: Internal error processing mutateRows" error?

I'm using happybase to connect to my Hbase database. I made a sample table called 'irisSample'. Here's the part of the code that I'm having trouble with- import happybase from happybase import * import json connection =…
Ravaal
  • 3,233
  • 6
  • 39
  • 66
0
votes
2 answers

Writing contents to json file which already has headers

So I am experimenting with happybase and I want to write the contents of a scan sequence to a json document with a skeleton I've already put in. This is the skeleton for the intended output file: [{ "Key": "", "Values": "" }] and from the…
moodygeek
  • 127
  • 13
0
votes
1 answer

'No module named happybase' when running from PIG

I have a Python UDF which is connecting to HBase using Happybase. If I run the code from Python 2.7 it works perfectly. However when I call the Python UDF from Pig 0.15.0 I am getting the following error: ERROR org.apache.pig.tools.grunt.Grunt -…
arie
  • 782
  • 3
  • 13
  • 31
0
votes
1 answer

how can i run RandomRowFilter in happybase

I want to sample rowkey in hbase by happybase(because of memory limit) So I search and implemet import happybase """~ """" table = connection.table('drivers') a=list(table.scan(filter="RandomRowFilter (chance=0.1f)" )) or…
0
votes
0 answers

import happybase works in pyspark shell , but not in spark-submit

I have happybase installed on all nodes of the spark cluster (YARN cluster manager). import happybase works from the pyspark shell, and also works when I launch the app using bin/pyspark. But when I try to launch the app using spark-submit , it…
Clyde D'Cruz
  • 1,915
  • 1
  • 14
  • 36
0
votes
1 answer

difference between happybase table.scan() and hbase thrift scannerGetList()

I have two version of python script that scans the table in hbase by 1000 rows in while loop. 1st one using happybase as in https://happybase.readthedocs.org/en/latest/user.html#retrieving-rows while variable: for key, data in…
Lost test
  • 9
  • 1
  • 5
0
votes
1 answer

Output separated HBase columns using happybase

I have such HBase-table: total date1:tCount1 date2:tCount2 ... url1 date1:clickCount1 date2:clickCount2 ... url2 date1:clickCount1 date2:clickCount2 ... ... url1, url2, ... are row keys. The table has only one column family. I have a date range…
VeLKerr
  • 2,995
  • 3
  • 24
  • 47
0
votes
2 answers

Get only the first 10 columns of a row using happybase

Is it possible to get only a limited number of columns for a column family from a row? Lets say I just want to fetch the first 10 values for ['cf1': 'col1'] for a particular row.
0
votes
1 answer

Thrift error while generating python client file

I'm new to Hbase and I would like to comunicate with it throught a python API which works with Thrift. I've followed this tutorial in order to install it properly on my machine, everything seemed to worked fine then I generated a .thrift file with…
Oscar
  • 231
  • 4
  • 17
0
votes
1 answer

Filtering integers with HBase + Python

I am trying to filter rows from a HBase table (I am using HappyBase), concretely I am trying to get rows whose 'id' is less than 1000: for key, data in graph_table.scan(filter="SingleColumnValueFilter('cf', 'id', <, 'binary:1000')"): print key,…
0
votes
1 answer

Create Hbase on Amazon EC2 and use it from python

I want to create a database Hbase on Amazon EC2 and write some test data using python Happybase. How to do it? Please tell me links where I can read about it. Thanks