Questions tagged [hbase]

HBase is the Hadoop database (columnar). Use it when you need random, real time read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

HBase is an open source, non-relational, distributed,versioned, column-oriented database modeled after Google's Bigtable and is written in Java. Bigtable: A Distributed Storage System for Structured by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Distributed File System(HDFS). HBase includes: It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Bigtable-like capabilities for Hadoop.

  • Convenient base classes for backing Hadoop MapReduce jobs with HBase tables including cascading, hive and pig source and sink modules
  • Query predicate push down via server side scan and get filters
  • Optimizations for real time queries
  • A Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
6961 questions
39
votes
7 answers

Scan HTable rows for specific column value using HBase shell

I want to scan rows in a HTable from hbase shell where a column family (i.e., Tweet) has a particular value (i.e., user_id). Now I want to find all rows where tweet:user_id has value test1 as this column has value 'test1' column=tweet:user_id,…
Nishu Tayal
  • 20,106
  • 8
  • 49
  • 101
35
votes
4 answers

HBase (Easy): How to Perform Range Prefix Scan in hbase shell

I am designing an app to run on hbase and want to interactively explore the contents of my cluster. I am in the hbase shell and I want to perform a scan of all keys starting with the chars "abc". Such keys might inlcude "abc4", "abc92", "abc20014"…
David Williams
  • 8,388
  • 23
  • 83
  • 171
32
votes
1 answer

What is hive, Is it a database?

I just started exploring Hive. It has all the structures similar to an RDBMS like tables, joins, partitions.. what i understand is Hive still uses HDFS for storage and it is an SQL abstraction of HDFS. From this I am not sure weather Hive itself is…
Brainchild
  • 1,814
  • 5
  • 27
  • 52
31
votes
1 answer

How Do You Rename a Table in HBase?

I'm trying to rename a table in HBase but the help in the shell doesn't have a rename command. move, mv and other common culprits don't appear to be it, either.
WattsInABox
  • 4,548
  • 2
  • 33
  • 43
31
votes
4 answers

Get output from scans in hbase shell

Is there any way I can output the results from a scan in the hbase shell to a file? I'm assuming this is easy but I haven't been able to find anything in the documentation.
Dave Novelli
  • 2,086
  • 4
  • 32
  • 42
29
votes
3 answers

How to specify multiple dependencies using --packages for spark-submit?

I have the following as the command line to start a spark streaming job. spark-submit --class com.biz.test \ --packages \ org.apache.spark:spark-streaming-kafka_2.10:1.3.0 \ …
davidpricedev
  • 2,107
  • 2
  • 20
  • 34
27
votes
5 answers

Books to start learning big data

I would like to start learning about the big data technologies. I want to work in this area in the future. Does anyone know good books to start learning about it? Hadoop, HBase. Beginner - intermediate - advanced - Thanks in advance
Gunter Amorim
  • 77
  • 1
  • 5
  • 14
26
votes
6 answers

how to list all row keys in an hbase table?

Can anybody tell me, how to list all row keys in an hbase table?
hbase_user
  • 529
  • 4
  • 9
  • 16
26
votes
6 answers

How to decide which NoSQL technology to use?

What is the pros and cons of MongoDB (document-based), HBase (column-based) and Neo4j (objects graph)? I'm particularly interested to know some of the typical use cases for each one. What are good examples of problems that graphs can solve…
Chiron
  • 20,081
  • 17
  • 81
  • 133
26
votes
5 answers

Can OLAP be done in BigTable?

In the past I used to build WebAnalytics using OLAP cubes running on MySQL. Now an OLAP cube the way I used it is simply a large table (ok, it was stored a bit smarter than that) where each row is basically a measurement or and aggregated set of…
Niels Basjes
  • 10,424
  • 9
  • 50
  • 66
25
votes
7 answers

How to copy an existing hbase table

I have a Hbase table X and I want to create an exact copy of it and name it Y. Could someone let me know how it is possible? Thanks
user1586205
  • 283
  • 1
  • 3
  • 7
24
votes
2 answers

How to mass delete multiple rows in hbase?

I have the following rows with these keys in hbase table "mytable" user_1 user_2 user_3 ... user_9999999 I want to use the Hbase shell to delete rows from: user_500 to user_900 I know there is no way to delete, but is there a way I could use the…
Rolando
  • 58,640
  • 98
  • 266
  • 407
23
votes
6 answers

What is use of hcatalog in hadoop?

I'm new to Hadoop. I know that the HCatalog is a table and storage management layer for Hadoop. But how exactly it works and how to use it. Please give some simple example.
Vijay_Shinde
  • 1,332
  • 2
  • 17
  • 38
22
votes
5 answers

How to list the regions in an HBase table through the shell?

I would like to get the same information about the regions of a table that appear in the web UI (i.e. region name, region server, start/end key, locality), but through the hbase shell. (The UI is flaky/slow, and furthermore I want to process this…
Kevin Pauli
  • 8,577
  • 15
  • 49
  • 70
22
votes
7 answers

Java ORM for Hbase

Anyone knows a good Java ORM implementation for HBase. This one looks really nice for Ruby http://www.stanford.edu/~sqs/rhino/doc/ But could not find one for Java. Thanks.
user392887
  • 1,399
  • 3
  • 9
  • 5