Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
14
votes
3 answers

Specify multiple filters in hbase

Is there a way to specify multiple filters during a scan? For example - Specify both a ColumnFamilyFilter and RowFilter? Filter rowFilter = new RowFilter(CompareFilter.CompareOp.EQUAL, new RegexStringComparator( …
priya
  • 191
  • 1
  • 1
  • 9
14
votes
2 answers

Hbase Schema Nested Entity

Does anyone have an example on how to create an Hbase table with a nested entity? Example UserName (string) SSN (string) + Books (collection) The books collection would look like this for example Books isbn title etc... I cannot find a single…
nitefrog
  • 1,760
  • 6
  • 31
  • 59
14
votes
4 answers

can not access HBase status UI on http://localhost:60010

I use ubuntu 12.04, after download and un-tar hbase-0.94.1.jar. I started hbash in standalone mode by the following command user@ubuntu:~/temp/hbase-0.94.1$ bin/start-hbase.sh starting master, logging to…
user1591487
  • 425
  • 1
  • 5
  • 16
13
votes
2 answers

GUI Tool for HBase Management

Are there any good GUI tools available for HBase? Like MySQL has workbench and Yog.
Kris
  • 8,680
  • 4
  • 39
  • 67
13
votes
0 answers

Scalable distributed file system for blobs like images and other documents

Cassandra & HBase both do not efficiently support storage of blobs like images. Storing directly on HDFS stresses the Namenode. Facebook's Haystack is not open source. So is Lustre a good choice for distributed blob storage? What are other…
Pinnacle
  • 149
  • 1
  • 8
13
votes
7 answers

HBase Error - assignment of -ROOT- failure

I've just installed hadoop and hbase from cloudera (3) but when I try to go to http://localhost:60010 it just sits there continually loading. I can get to the regionserver fine - http://localhost:60030... Looking at the master hbase server logs I…
NightWolf
  • 7,694
  • 9
  • 74
  • 121
13
votes
3 answers

How can I pre-split a table in HBase

I am storing data in HBase having 5 region servers. I am using md5 hash of url as my row keys. Currently all the data is getting stored in one region server only. So I want to pre-split the regions so that data will go uniformly across all region…
Harsh Sharma
  • 10,942
  • 2
  • 18
  • 29
13
votes
5 answers

What is "Hadoop" - the definition of Hadoop?

It is kind of obvious and we will all agree that we can call HDFS + YARN + MapReduce as Hadoop. But what happens with different other combinations and other products in the Hadoop ecosystem? Is, for example, HDFS + YARN + Spark still Hadoop? Is…
neuromouse
  • 921
  • 1
  • 12
  • 32
13
votes
1 answer

java.sql.SQLException: org.apache.thrift.transport.TTransportException in hive?

I'm trying to create table in hive with java. I found java.sql.SQLException: org.apache.thrift.transport.TTransportException while executing my code. Here is my code public void createTable(String tableName) { try{ Statement…
user1099871
12
votes
1 answer

Import data from HDFS to HBase (cdh3u2)

I have Installed hadoop and hbase cdh3u2. In hadoop i have a file at the path /home/file.txt. it has the data like one,1 two,2 three,3 I want to import this file into hbase. in that, the first field should parsed as String, and 2nd field parsed as…
Nageswaran
  • 7,481
  • 14
  • 55
  • 74
12
votes
2 answers

Hadoop Hbase: Spreading column families across tables or not

The Hbase documentation makes it clear that you should group similar columns into column families, because the physical storage is done by column family. But what does it mean to put two column families into the same table, as opposed to having…
Thilo
  • 257,207
  • 101
  • 511
  • 656
12
votes
2 answers

Which HBase connector for Spark 2.0 should I use?

Our stack is composed of Google Data Proc (Spark 2.0) and Google BigTable (HBase 1.2.0) and I am looking for a connector working with these versions. The Spark 2.0 and the new DataSet API support is not clear to me for the connectors I have…
12
votes
2 answers

HBase Kerberos connection renewal strategy

Recently I enabled kerberos in my cluster, everything works great until my kerberos login expires, at say, 12 hours. At that point any connections I have created, any tables created with those connections etc will throw when I use them. This could…
user1310957
12
votes
2 answers

HBase- Store file vs HFile and Compaction

What is the difference between Store file and HFile?? I have basic idea about compaction i.e. store files are merged together to reduce seeks from the disk. Is it correct?? Can someone explain more about Compaction like the exact process and how it…
Srinu Katta
  • 155
  • 1
  • 9
12
votes
2 answers

HBase regionserver is aborted and can never be brought up after that

I used OpenTSDB over HBase (pseudo-distributed Hadoop on virtual box) to send data at very high load (~ 50,000 records / s). The system worked properly for a while but it went down suddenly. I terminated OpenTSDB and HBase. Unfortunately, I could…
duong_dajgja
  • 4,196
  • 1
  • 38
  • 65