Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
2
votes
0 answers

FATAL org.apache.hadoop.hbase.master.HMaster Unhandled exception. Starting shutdown

I have 4 nodes CentOS hadoop cluster. I installed cloudera manager 5.5.1. I failed to start Hbase Master. FATAL org.apache.hadoop.hbase.master.HMaster Unhandled exception. Starting shutdown . Caused by: …
2
votes
1 answer

How to get the size of a mapr-db table?

I followed following steps for creating a mapr-db table: $ pwd /mapr/cluster1/user/project $ls bin src $ maprcli volume create -name project-tables-vol -path /user/project/tables \ -quota 100G -topology /data/rack1 $ ls bin src …
Shakti Garg
  • 263
  • 3
  • 15
2
votes
1 answer

Retrieve the Value of Dynamicaly Generated Columns from Hbase

For example, here is some records in a hbase table: 123, column=cf:dcol#0, value=aaaa 123, column=cf:dcol#1, value=bbbb 123, column=cf:dcol#2, value=cccc 123, column=cf:someOtherCol, value=dddd The column dcol# is create dynamically by…
user2296188
  • 139
  • 8
2
votes
1 answer

"Insert" only permission in Hbase

Is it possible to give just "Insert only" permission on table/columnfamily in Hbase? I checked Hbase documentation on Apache but did not find anything relevant, current ACL has "Write" permission which will include "insert+update". Can we do just…
Chhaya Vishwakarma
  • 1,407
  • 9
  • 44
  • 72
2
votes
1 answer

Why does happybase return "TSocket read 0 bytes" when I try running table.scan()?

I'm using happybase to connect to HBase. So far the connection is fine. We have thrift up and running. I'm able to connect to the table (AFAIK), but when I try to scan the table I get TTransportException: TSocket read 0 bytes. Here is the code that…
Ravaal
  • 3,233
  • 6
  • 39
  • 66
2
votes
1 answer

Order Partitionning vs Random Partitioning

According to most articles on internet Random Partitioning(RP) is better than Ordered Partitioning(OP) cause of the data distribution. in fact, I think, that cause of data replication even if we are using the OP the data will be well distributed !…
Mehdi TAZI
  • 575
  • 2
  • 5
  • 23
2
votes
0 answers

How can I lay out a rowkey or partition strategy for this scenario to prevent region hotspotting?

I'm evaluating a set of data stores for a project that I am working on and one of the options is to store information in HBase. I'm aware that there are alternatives, but I wanted to give a shot of designing a schema in HBase for this use case. I…
encee
  • 4,544
  • 4
  • 33
  • 35
2
votes
2 answers

Microsoft PowerBI with Hortonworks Hive/HBase/Spark Integration

I'm thrilled with Microsoft's offering with PowerBI but still not able to find any possible direct way to integrate with my Hortonworks Hadoop cluster. I went through the tutorials and found two things: PowerBI can fetch data from HDInsights Azure…
Ram
  • 73
  • 1
  • 5
2
votes
2 answers

Why Phoenix always add a extra column (named _0) to hbase when I execute UPSERT command?

When I execute the UPSERT command on apache phoenix, I always see that Phoenix add an extra column (named _0) with an empty value in the hbase, this column(_0) is auto generate by phoenix, but I don't need it, like this: ROW COLUMN+CELL …
Guo
  • 1,761
  • 2
  • 22
  • 45
2
votes
1 answer

How to load the latest 100 rows from Hbase to Spark

I am using Spark to load Hbase data into a JavaPairRDD<>. Right now, I am wondering if I can just load the latest 100 rows into Spark instead of all rows from Hbase. 1) I tried scan.setCaching(100), but it still returned all rows. Is it for the…
Laodao
  • 1,547
  • 3
  • 17
  • 39
2
votes
1 answer

The node /hbase-unsecure is not in ZooKeeper. It should have been written by the master

I am going to try hbase in local, I did start one zk server,when I try to create a table like this: create 'table', 'col' error: ERROR [main] client.ConnectionManager$HConnectionImplementation: The node /hbase-unsecure is not in ZooKeeper. It…
ZhAIiyI
  • 41
  • 2
  • 4
2
votes
3 answers

How to create a Phoenix Table / View over an Existing Hbase table

I am new to Phoenix, Hbase. We have a requirement to issue SQL Like queries against Hbase Database. A decision was made to go with Apache Phoenix as the SQL Skin on top of an existing Hbase table which was already getting loaded with the Data that…
user3081172
  • 21
  • 1
  • 1
  • 3
2
votes
1 answer

Adding RegionServers when salting

I read about salting and how it is used for load balancing in case of sequential keys. Basically, salt should distribute sequential rows to different region servers. I also read this article which explains how to run MR jobs on tables which were…
Kobe-Wan Kenobi
  • 3,694
  • 2
  • 40
  • 67
2
votes
2 answers

OpenTSDB Hbase RowKey design

The OpenTSDB row-key is designed as . I could understand that the design of the key leads to avoiding hot spots while writing data. But while reading if I am seeing all the metrics corresponding to a particular host, it leads to reading data from…
shubham003
  • 703
  • 2
  • 9
  • 20
2
votes
0 answers

ConnectException: connection refused when I use Java Client to connect HBase

My HBase Version is 1.1.2 And hbase java connector is : org.apache.hbase hbase-client 1.1.2 My HBase Cluster has 1 Hmaster …
iameven
  • 338
  • 4
  • 15
1 2 3
99
100