Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
Query predicate push down via server side scan and get filters
Optimizations for real time queries
A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
Extensible jruby-based (JIRB) shell
Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX

6961 questions

votes

0 answers

Can I connect NiFi docker container to a HBase container over a docker user defined bridge network?

My objective: Use NiFi running on a HDF docker container to store data into HBase running on an HDP docker container. Progress: I am running two docker containers: NiFi and HBase. I have configured NiFi's PutHBaseJSON processor to write data to…

asked Feb 03 '17 at 21:14

James Medel

votes

1 answer

Get row on Spark in map Call

Itry to aggregate data from a file in HDFS. I need to add some details from those datas with value on a specific Table in hbase. but I have the exception : org.apache.spark.SparkException: Task not serializable at…

java hadoop apache-spark hbase

asked Feb 01 '17 at 17:07

okitas

votes

0 answers

HBase BulkUpload KeyValuePair vs Put

HBase Bulkupload either can accept key-value-pairs, Puts or Deletes. I assume Deletes is for deleting rows on mass. How about Puts? Is puts just for updating a cell value where as key-value-pairs is for inserting an entire row? Just to be clear…

hbase

asked Jan 31 '17 at 00:00

hba

7,406
10
63
105

votes

1 answer

Hbase Table.batch takes 300 secs to insert 800,000 entries into table

I am reading a json file of size 30 mb, process to create column family and key values. Then create Put object, insert rowkey and values into it. Create list of such put objects and call Table.batch() and pass this list. I am calling this when my…

hbase apache-kafka kafka-consumer-api kafka-producer-api hbase-client

asked Jan 30 '17 at 12:30

AnswerSeeker

votes

0 answers

HBase 1.2.4 integration with Spark 2.1.0 and Hadoop 2.7.3 in fully distributed mode in Ubuntu 14.4

I am trying to integrate HBase with Spark. I did two types of integration but getting error. First I copied all the HBase lib jars and paste into to Spark jars folder. Some HBase jars conflicted with Spark jars so I kept the spark jars in that case.…

scala apache-spark hbase hadoop2

asked Jan 30 '17 at 00:22

Mahmud

votes

2 answers

How do I mix MUST_PASS_ALL and MUST_PASS_ONE in HBase filters?

I have an HBase scan with a ColumnPrefixFilter and multiple FuzzyFilters like so: FilterList filterList = new FilterList(); filterList.addFilter(new FuzzyFilter(...)); filterList.addFilter(new FuzzyFilter(...)); filterList.addFilter(new…

java hbase

asked Jan 26 '17 at 15:20

Di Zou

4,469
13
59
88

votes

1 answer

Is there a clever HBase Schema to Aid with Discovering Missing Value?

Let's assume I have billions of rows in my HBase table. The rows in this table change slowly, meaning there will be new rowkeys and some rowkeys get deleted. I receive lots of events per row. However, there will be very few rows that will not have…

mapreduce hbase bigdata

asked Jan 25 '17 at 21:33

hba

7,406
10
63
105

votes

1 answer

HBase-indexer & Solr : NOT found data

I am currently using hbase-indexer to index hbase in solr. When I execute foolowing command to check the indexer, hbase-indexer$ bin/hbase-indexer list-indexers --zookeeper 127.0.0.1:2181 The result is said that: myindexer + Lifecycle state:…

indexing solr hbase

asked Jan 24 '17 at 09:25

Chris Joo

votes

1 answer

Issue on Spark Streaming data put data into HBase

I am a beginner in this field, so I can not get a sense of it... HBase ver: 0.98.24-hadoop2 Spark ver: 2.1.0 The following code try to put receiving data from Spark Streming-Kafka producer into HBase. Kafka input data format is like this :…

java apache-spark hbase spark-streaming

asked Jan 20 '17 at 13:28

Chris Joo

votes

1 answer

Hbase CopyTable inside Java

I want to copy one Hbase table to another location with good performance. I would like to reuse the code from CopyTable.java from Hbase-server github page I've been looking the doccumentation from hbase but it didn't help me much…

java hadoop hbase

asked Jan 20 '17 at 09:31

kulssaka

votes

1 answer

How can we list the tables in HBase created/modified after a particular date/time?

I am trying to write a backup script of HBase tables. So I wanted to get all those tables which are created/modified after a particular time. Is there any command for that? Thanks for the suggestions in advance

hbase hdfs nosql

asked Jan 17 '17 at 22:34

Midhun Mathew Sunny

1,271
4
17
30

votes

1 answer

"The node /hbase is not in ZooKeeper" when running commands in Hbase shell

I'm trying to install HBase v1.1.2 (with Hadoop v2.7.2) in standalone mode, and am following the Apache HBase docs as closely as I can, however the configuration seems a bit tricky. Not sure if it's related to the fact that I installed HBase with…

hadoop hbase apache-zookeeper

asked Jan 03 '17 at 22:35

Richie Thomas

3,073
4
32
55

votes

1 answer

Why Too many open files in Hbase

I have configured a 2 node cluster with hadoop and installed hbase. It was working properly and I have run some basic map reduce jobs in hadoop and I was able to create and list some tables in hbase too. However I have few data in hdfs/hbase and…

hadoop hbase file-descriptor

asked Jan 02 '17 at 07:15

Sami

votes

2 answers

Zookeeper not able to locate master node in hbase pseudo distributed mode

i want to run hbase in pseudo distributed mode, installation is from scratch. step followed start HBase. Start HBase Master Server Start the region server a Start HBase Shell execute list Hbase started normally so the hbase shell. When i…

hbase hdfs distributed-computing

asked Jan 02 '17 at 06:54

bittu

votes

0 answers

Hbase mapreduce: Retrieve rows from hbase based on date

My row key format is ID|date. Scenario: Whenever a request comes to add a row in hbase keep on adding it with ID and request date. There may not be rows for an ID everyday. * Ex: row1 - AB20160513 row2 - XY20160514 row3 -…

hadoop mapreduce hbase

asked Dec 30 '16 at 08:27

user3875464

Prev 1 2 3

…

99 100 Next