Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
Query predicate push down via server side scan and get filters
Optimizations for real time queries
A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
Extensible jruby-based (JIRB) shell
Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX

6961 questions

votes

0 answers

FATAL org.apache.hadoop.hbase.master.HMaster Unhandled exception. Starting shutdown

I have 4 nodes CentOS hadoop cluster. I installed cloudera manager 5.5.1. I failed to start Hbase Master. FATAL org.apache.hadoop.hbase.master.HMaster Unhandled exception. Starting shutdown . Caused by: …

asked Jan 21 '16 at 09:23

Charly Butarbutar

votes

1 answer

How to get the size of a mapr-db table?

I followed following steps for creating a mapr-db table: $ pwd /mapr/cluster1/user/project $ls bin src $ maprcli volume create -name project-tables-vol -path /user/project/tables \ -quota 100G -topology /data/rack1 $ ls bin src …

hbase mapr

asked Jan 14 '16 at 09:28

Shakti Garg

votes

1 answer

Retrieve the Value of Dynamicaly Generated Columns from Hbase

For example, here is some records in a hbase table: 123, column=cf:dcol#0, value=aaaa 123, column=cf:dcol#1, value=bbbb 123, column=cf:dcol#2, value=cccc 123, column=cf:someOtherCol, value=dddd The column dcol# is create dynamically by…

java database hbase dynamically-generated bigdata

asked Jan 12 '16 at 19:52

user2296188

votes

1 answer

"Insert" only permission in Hbase

Is it possible to give just "Insert only" permission on table/columnfamily in Hbase? I checked Hbase documentation on Apache but did not find anything relevant, current ACL has "Write" permission which will include "insert+update". Can we do just…

security hbase acl

asked Jan 12 '16 at 11:37

Chhaya Vishwakarma

1,407
9
44
72

votes

1 answer

Why does happybase return "TSocket read 0 bytes" when I try running table.scan()?

I'm using happybase to connect to HBase. So far the connection is fine. We have thrift up and running. I'm able to connect to the table (AFAIK), but when I try to scan the table I get TTransportException: TSocket read 0 bytes. Here is the code that…

python hbase thrift happybase

asked Jan 11 '16 at 16:57

Ravaal

3,233
6
39
66

votes

1 answer

Order Partitionning vs Random Partitioning

According to most articles on internet Random Partitioning(RP) is better than Ordered Partitioning(OP) cause of the data distribution. in fact, I think, that cause of data replication even if we are using the OP the data will be well distributed !…

cassandra hbase

asked Jan 10 '16 at 21:04

Mehdi TAZI

votes

0 answers

How can I lay out a rowkey or partition strategy for this scenario to prevent region hotspotting?

I'm evaluating a set of data stores for a project that I am working on and one of the options is to store information in HBase. I'm aware that there are alternatives, but I wanted to give a shot of designing a schema in HBase for this use case. I…

hbase schema region

asked Jan 04 '16 at 23:07

encee

4,544
4
33
35

votes

2 answers

Microsoft PowerBI with Hortonworks Hive/HBase/Spark Integration

I'm thrilled with Microsoft's offering with PowerBI but still not able to find any possible direct way to integrate with my Hortonworks Hadoop cluster. I went through the tutorials and found two things: PowerBI can fetch data from HDInsights Azure…

apache-spark hive hbase hortonworks-data-platform powerbi

asked Jan 04 '16 at 13:20

Ram

votes

2 answers

Why Phoenix always add a extra column (named _0) to hbase when I execute UPSERT command?

When I execute the UPSERT command on apache phoenix, I always see that Phoenix add an extra column (named _0) with an empty value in the hbase, this column(_0) is auto generate by phoenix, but I don't need it, like this: ROW COLUMN+CELL …

hadoop hbase apache-phoenix

asked Dec 29 '15 at 08:41

Guo

1,761
2
22
45

votes

1 answer

How to load the latest 100 rows from Hbase to Spark

I am using Spark to load Hbase data into a JavaPairRDD<>. Right now, I am wondering if I can just load the latest 100 rows into Spark instead of all rows from Hbase. 1) I tried scan.setCaching(100), but it still returned all rows. Is it for the…

java apache-spark hbase

asked Dec 23 '15 at 16:23

Laodao

1,547
3
17
39

votes

1 answer

The node /hbase-unsecure is not in ZooKeeper. It should have been written by the master

I am going to try hbase in local, I did start one zk server,when I try to create a table like this: create 'table', 'col' error: ERROR [main] client.ConnectionManager$HConnectionImplementation: The node /hbase-unsecure is not in ZooKeeper. It…

hadoop hbase apache-zookeeper

asked Dec 21 '15 at 15:14

ZhAIiyI

votes

3 answers

How to create a Phoenix Table / View over an Existing Hbase table

I am new to Phoenix, Hbase. We have a requirement to issue SQL Like queries against Hbase Database. A decision was made to go with Apache Phoenix as the SQL Skin on top of an existing Hbase table which was already getting loaded with the Data that…

sql apache hbase apache-phoenix

asked Dec 18 '15 at 15:32

user3081172

votes

1 answer

Adding RegionServers when salting

I read about salting and how it is used for load balancing in case of sequential keys. Basically, salt should distribute sequential rows to different region servers. I also read this article which explains how to run MR jobs on tables which were…

hadoop hbase

asked Dec 18 '15 at 09:47

Kobe-Wan Kenobi

3,694
2
40
67

votes

2 answers

OpenTSDB Hbase RowKey design

The OpenTSDB row-key is designed as . I could understand that the design of the key leads to avoiding hot spots while writing data. But while reading if I am seeing all the metrics corresponding to a particular host, it leads to reading data from…

hbase opentsdb

asked Dec 15 '15 at 18:41

shubham003

votes

0 answers

ConnectException: connection refused when I use Java Client to connect HBase

My HBase Version is 1.1.2 And hbase java connector is : org.apache.hbase hbase-client 1.1.2 My HBase Cluster has 1 Hmaster …

java hadoop hbase

asked Dec 15 '15 at 09:22

iameven

Prev 1 2 3

…

100 Next