Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
2
votes
0 answers

Can I connect NiFi docker container to a HBase container over a docker user defined bridge network?

My objective: Use NiFi running on a HDF docker container to store data into HBase running on an HDP docker container. Progress: I am running two docker containers: NiFi and HBase. I have configured NiFi's PutHBaseJSON processor to write data to…
2
votes
1 answer

Get row on Spark in map Call

Itry to aggregate data from a file in HDFS. I need to add some details from those datas with value on a specific Table in hbase. but I have the exception : org.apache.spark.SparkException: Task not serializable at…
okitas
  • 31
  • 6
2
votes
0 answers

HBase BulkUpload KeyValuePair vs Put

HBase Bulkupload either can accept key-value-pairs, Puts or Deletes. I assume Deletes is for deleting rows on mass. How about Puts? Is puts just for updating a cell value where as key-value-pairs is for inserting an entire row? Just to be clear…
hba
  • 7,406
  • 10
  • 63
  • 105
2
votes
1 answer

Hbase Table.batch takes 300 secs to insert 800,000 entries into table

I am reading a json file of size 30 mb, process to create column family and key values. Then create Put object, insert rowkey and values into it. Create list of such put objects and call Table.batch() and pass this list. I am calling this when my…
2
votes
0 answers

HBase 1.2.4 integration with Spark 2.1.0 and Hadoop 2.7.3 in fully distributed mode in Ubuntu 14.4

I am trying to integrate HBase with Spark. I did two types of integration but getting error. First I copied all the HBase lib jars and paste into to Spark jars folder. Some HBase jars conflicted with Spark jars so I kept the spark jars in that case.…
Mahmud
  • 87
  • 10
2
votes
2 answers

How do I mix MUST_PASS_ALL and MUST_PASS_ONE in HBase filters?

I have an HBase scan with a ColumnPrefixFilter and multiple FuzzyFilters like so: FilterList filterList = new FilterList(); filterList.addFilter(new FuzzyFilter(...)); filterList.addFilter(new FuzzyFilter(...)); filterList.addFilter(new…
Di Zou
  • 4,469
  • 13
  • 59
  • 88
2
votes
1 answer

Is there a clever HBase Schema to Aid with Discovering Missing Value?

Let's assume I have billions of rows in my HBase table. The rows in this table change slowly, meaning there will be new rowkeys and some rowkeys get deleted. I receive lots of events per row. However, there will be very few rows that will not have…
hba
  • 7,406
  • 10
  • 63
  • 105
2
votes
1 answer

HBase-indexer & Solr : NOT found data

I am currently using hbase-indexer to index hbase in solr. When I execute foolowing command to check the indexer, hbase-indexer$ bin/hbase-indexer list-indexers --zookeeper 127.0.0.1:2181 The result is said that: myindexer + Lifecycle state:…
Chris Joo
  • 577
  • 10
  • 24
2
votes
1 answer

Issue on Spark Streaming data put data into HBase

I am a beginner in this field, so I can not get a sense of it... HBase ver: 0.98.24-hadoop2 Spark ver: 2.1.0 The following code try to put receiving data from Spark Streming-Kafka producer into HBase. Kafka input data format is like this :…
Chris Joo
  • 577
  • 10
  • 24
2
votes
1 answer

Hbase CopyTable inside Java

I want to copy one Hbase table to another location with good performance. I would like to reuse the code from CopyTable.java from Hbase-server github page I've been looking the doccumentation from hbase but it didn't help me much…
kulssaka
  • 226
  • 8
  • 27
2
votes
1 answer

How can we list the tables in HBase created/modified after a particular date/time?

I am trying to write a backup script of HBase tables. So I wanted to get all those tables which are created/modified after a particular time. Is there any command for that? Thanks for the suggestions in advance
Midhun Mathew Sunny
  • 1,271
  • 4
  • 17
  • 30
2
votes
1 answer

"The node /hbase is not in ZooKeeper" when running commands in Hbase shell

I'm trying to install HBase v1.1.2 (with Hadoop v2.7.2) in standalone mode, and am following the Apache HBase docs as closely as I can, however the configuration seems a bit tricky. Not sure if it's related to the fact that I installed HBase with…
Richie Thomas
  • 3,073
  • 4
  • 32
  • 55
2
votes
1 answer

Why Too many open files in Hbase

I have configured a 2 node cluster with hadoop and installed hbase. It was working properly and I have run some basic map reduce jobs in hadoop and I was able to create and list some tables in hbase too. However I have few data in hdfs/hbase and…
Sami
  • 490
  • 6
  • 29
2
votes
2 answers

Zookeeper not able to locate master node in hbase pseudo distributed mode

i want to run hbase in pseudo distributed mode, installation is from scratch. step followed start HBase. Start HBase Master Server Start the region server a Start HBase Shell execute list Hbase started normally so the hbase shell. When i…
bittu
  • 88
  • 6
2
votes
0 answers

Hbase mapreduce: Retrieve rows from hbase based on date

My row key format is ID|date. Scenario: Whenever a request comes to add a row in hbase keep on adding it with ID and request date. There may not be rows for an ID everyday. * Ex: row1 - AB20160513 row2 - XY20160514 row3 -…
user3875464
  • 121
  • 9