Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
15
votes
2 answers

Using HBase to store time series data

We are trying to use HBase to store time-series data. The model we have currently stores the time-series as versions within a cell. This implies that the cell could end up storing millions of versions, and the queries on this time-series would…
gurrie
  • 225
  • 1
  • 3
  • 10
15
votes
4 answers

HBase cassandra couchdb mongodb..any fundamental difference?

I just wanted to know if there is a fundamental difference between hbase, cassandra, couchdb and monogodb ? In other words, are they all competing in the exact same market and trying to solve the exact same problems. Or they fit best in different…
Federico
  • 5,438
  • 5
  • 39
  • 47
15
votes
4 answers

creating partition in external table in hive

I have successfully created and added Dynamic partitions in an Internal table in hive. i.e. by using following steps: 1-created a source table 2-loaded data from local into source table 3- created another table with partitions - partition_table 4-…
Anoop Mamgain
  • 187
  • 2
  • 3
  • 13
15
votes
1 answer

how to get the row key from hbase scan result

when scan the hbase table row by row, how can i get the row key? here is my code: for (Result rr : scanner) { System.out.println(rr); } is there any method like getKey() that i can use? thanks.
user468587
  • 4,799
  • 24
  • 67
  • 124
15
votes
3 answers

hbase connection refused

I am new to HBase and Hadoop. I have completely setup HBase and started perfectly. Now when I tried to connect from p1 to HBase (where HBase setup is on p2) using a Java client, then it is throwing a strange exception. 12/04/17 14:36:37 INFO…
khan
  • 2,664
  • 8
  • 38
  • 64
14
votes
1 answer

Hbase put shell command

How can I use the put command in Hbase shell to insert more than one value. I have a column family having five qualifiers. How can I insert values to all those 5 qualifiers using one single command using Put
Patruni Srikanth
  • 741
  • 1
  • 7
  • 14
14
votes
5 answers

Hbase client can't connect to remote Hbase server

i have written a following hbase client class for remote server: System.out.println("Hbase Demo Application "); // CONFIGURATION // ENSURE RUNNING try { HBaseConfiguration config = new…
Ali Raza
  • 1,215
  • 3
  • 15
  • 26
14
votes
3 answers

Installing Hbase / Hadoop on EC2 cluster

I know that I can spin off a EC2 cluster with Hadoop installed (unless I am wrong about that). How about Hbase? Can I have the Hadoop and Hbase premade, ready to go? Or do I need to get my hands dirty. If it is not an option, what is the best…
delmet
  • 1,013
  • 2
  • 9
  • 23
14
votes
3 answers

How to export data to text file in Apache phoenix?

I'm quite new to HBase and Phoenix. But is there a way I can dump/export data to a text file? It would be highly appreciable if I can specify the field terminator, such as ',', '|>' etc. Thanks.
dehiker
  • 454
  • 1
  • 8
  • 21
14
votes
2 answers

Using Phoenix with Cloudera Hbase (installed from repo)

I can get Phoenix working on a standalone Apache Hbase (note, all this is for Hbase 1.0.0 on RHEL6.5) For the Cloudera flavour of Hbase however I never get it working without it throwing Exceptions. (even tried RHEL7 minimal as en OS) The same thing…
Havnar
  • 2,558
  • 7
  • 33
  • 62
14
votes
2 answers

HBase scans are slow

Problem I am trying to build a secondary index with Phoenix. Index creation takes several hours. It seems to be due to slow HBase scans, as I noticed the following performance : I might need 2 hours to scan the table, whereas other developers…
Martin Pernollet
  • 2,285
  • 1
  • 28
  • 39
14
votes
1 answer

Does Spark use data locality?

I'm trying to understand Apache Spark's internals. I wonder if Spark uses some mechanisms to ensure data locality when reading from InputFormat or writing to an OutputFormat (or other formats natively supported by Spark and not derived from…
Nicola Ferraro
  • 4,051
  • 5
  • 28
  • 60
14
votes
4 answers

Is there a good library for accessing HBase from Python?

I spent some time looking around, and all I could find is Jython. It's an option, but is there something that could be used in a more pythonesque (simpler) way?
Wojtek
  • 143
  • 1
  • 1
  • 5
14
votes
8 answers

get "ERROR: Can't get master address from ZooKeeper; znode data == null" when using Hbase shell

I installed Hadoop2.2.0 and Hbase0.98.0 and here is what I do : $ ./bin/start-hbase.sh $ ./bin/hbase shell 2.0.0-p353 :001 > list then I got this: ERROR: Can't get master address from ZooKeeper; znode data == null Why am I getting this error ?…
Rickie Lau
  • 291
  • 2
  • 8
  • 15
14
votes
1 answer

How do I get a Row with HBASE Shell where the rowkey is in Hexadecimal?

If I have a rowkey that is in Hexadecimal, like x00\x01 , how do I get that in the HBASE shell? hbase(main):004:0> scan 'tsdb-tree' ROW COLUMN+CELL \x00\x01 column=t:tree, timestamp=1379421652764, value={"name":"...
Kyle Brandt
  • 26,938
  • 37
  • 124
  • 165