Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
2
votes
1 answer

Java API Error: org.apahe.hadoop.hbase does not exist

My HBase is version 1.1.3 and I can do CRUD operations using the Hbase Shell. I am trying to use the Java API to do CRUD operations on HBase.I am using Netbeans and have done Properties>Libraries>add JAR/Folder , and have added the entire lib folder…
tt2244
  • 71
  • 11
2
votes
1 answer

Retrieving Hbase versioned data

I am trying to retrieve different version of Hbase data. Step 1 - Table abc has 4 column all with version 1 and in single column family. a b c d 1 1 1 1 Step 2 - Column b and c values get changed and we load updated value of column b and…
dijin
  • 61
  • 6
2
votes
1 answer

Spark Stream Kafka and Hbase Config

I have few questions on spark streaming with Kafka and HBase. Below is my program for spark streaming,here i am using zookeeper configuartions to connect to Kafka and Hbase. Do we really need this configuration in the streaming code? Or i am doing…
nilesh1212
  • 1,561
  • 2
  • 26
  • 60
2
votes
0 answers

How to rename a column family in HBase using HBase Shell

In the HBase shell, it's possible to delete a column family using: alter 'tablename', NAME => 'familyname', METHOD => 'delete' It's also possible to create a new column family using: alter 'tablename', NAME => 'newfamilyname' So it's possible to…
Alfredo Gimenez
  • 2,174
  • 1
  • 14
  • 19
2
votes
1 answer

Spark HBase Join Error: object not serializable class: org.apache.hadoop.hbase.client.Result

I have data across two hbase tables and need to get the joined result from them. What is the best way to get joined result.? I tried joining using RDDs, but it gave me error. I am getting the following error. object not serializable (class:…
Srini
  • 3,334
  • 6
  • 29
  • 64
2
votes
1 answer

How do you configure Apache Nutch 2.3 to honour robots metatag?

I have Nutch 2.3 setup with HBase as the backend and I run a crawl of which includes the index to Solr and Solr Deduplication. I have recently noticed that the Solr index contains unwanted webpages. In order to get Nutch to ignore these webpages I…
saintybalboa
  • 268
  • 1
  • 2
  • 11
2
votes
1 answer

Filters in HBase: Designed to filter data row-wise, or column-wise, or both?

I've been confounded by how filters work in HBase (or, largely equivalently, in HappyBase--which I use to interact with HBase). The source of my confusion is that I can't seem to get a handle on what filters do. Some filters, like…
eriophora
  • 999
  • 1
  • 8
  • 20
2
votes
1 answer

Does Hbase have region replications

I knew Hbase has replicates of WAL and Hfile, but all of these are stored in HDFS as persistence. So will Hbase also provides region level replicas. We know Region contains BlockCache and MemCache, if only one Region instance handle requests, how…
Jack
  • 5,540
  • 13
  • 65
  • 113
2
votes
1 answer

spark python script not writing to hbase

I am trying to run the script from this blog import sys import json from pyspark import SparkContext from pyspark.streaming import StreamingContext def SaveRecord(rdd): host = 'sparkmaster.example.com' table = 'cats' …
user2065276
  • 313
  • 2
  • 16
2
votes
0 answers

Hbase logs error while starting master

I am using Hbase 0.94.8 version on a ubuntu based linux system. After following installation steps when i start HBASE_PATH$bin/start-hbase.sh then console says starting master, logging to …
psaha4
  • 339
  • 3
  • 17
2
votes
1 answer

Reducing the latency between Spark and HBase nodes

I am experiencing a high latency between Spark nodes and HBase nodes. The current resources I have require me to run HBase and Spark on different servers. The HFiles are compressed with Snappy algorithm, which reduces the data size of each region…
imriqwe
  • 1,455
  • 11
  • 15
2
votes
1 answer

multiple or bulk fetch or get in HBase Java client?

Does the HBase Java client support multi-get (aka bulk fetch), kind of like memcached multi-get? I've scoured the docs and the Internet for at least an hour and couldn't find anything definitive. The HTable API offers multiple Delete and multiple…
schematic
  • 1,964
  • 1
  • 16
  • 20
2
votes
2 answers

"fatal error: TProcessor.h: No such file or directory" when trying to install Rhbase package

everyone, I'm trying to install Rhbase package, but first I was missing thrift package, what I solved, but now it shows me another error. I added TProcessor.h into ../lib/cpp/src/thrift/processor/ but it didn't help and it shows me the same error:…
Andrea
  • 21
  • 4
2
votes
1 answer

how to change the name of column family in the hbase table

I have a Hbase table with only 1 column family(cf1) having 100 columns under it. I need to change cf1 to new column family(CF1). In short i need to modify column family name. I have gone through the following link…
sandeep007
  • 348
  • 3
  • 16
2
votes
1 answer

What is the hbase.zookeeper.quorum in hbase-site.xml

I would like to know how can I properly configure the hbase.zookeeper.quorum to point the zookeeper instance in a cluster mode.
yolanda_dlh
  • 43
  • 1
  • 1
  • 7