Apache HBase
Developer(s) | Apache Software Foundation |
---|---|
Stable release | 0.96.0 / 19 October 2013 |
Development status | Active |
Written in | Java |
Operating system | Cross-platform |
Type | Column oriented database |
License | Apache License 2.0 |
Website | hbase.apache.org |
HBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed Filesystem), providing BigTable-like capabilities for Hadoop. That is, it provides a fault-tolerant way of storing large quantities of sparse data (small amounts of information caught within a large collection of empty or unimportant data, such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection).
HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original BigTable paper.[1] Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs.
HBase is not a direct replacement for a classic SQL database, although recently its performance has improved, and it is now serving several data-driven websites,[2][3] including Facebook's Messaging Platform.[4][5]
In the parlance of Eric Brewer’s CAP theorem, HBase is a CP type system.
History
Apache HBase began as a project by the company Powerset out of a need to process massive amounts of data for the purposes of natural language search. It is now a top-level Apache project and has generated considerable interest.[6]
Facebook elected to implement its new messaging platform using HBase in November 2010.[4]
See also
- NoSQL
- Apache Cassandra
- Hypertable
- Apache Accumulo
- MongoDB
- Project Voldemort
- Riak
- Sqoop
- ElasticSearch
References
Bibliography
- Dimiduk, Nick; Khurana, Amandeep (November 28, 2012). HBase in Action (1st ed.). Manning Publications. p. 350. ISBN 978-1617290527.
- George, Lars (September 20, 2011). HBase: The Definitive Guide (1st ed.). O'Reilly Media. p. 556. ISBN 978-1449396107.
- Jiang, Yifeng (August 16, 2012). HBase Administration Cookbook (1st ed.). Packt Publishing. p. 332. ISBN 978-1849517140.
External links
- Official Apache HBase homepage
- Official Apache Hadoop homepage
- Understanding HBase
- A vendor-independent comparison of NoSQL databases: Cassandra, HBase, MongoDB, Riak (NetworkWorld)
|