From Glitchdata
Jump to navigation Jump to search

Apache HBase is a non-relational (NoSQL) database that runs on top of the Hadoop Distributed File System (HDFS). Its is columnar and provides fault-tolerant storage and quick access to large quantities of sparese data. It also adds transactional capabilities to Hadoop, allowing users to conduct updates, inserts and deletes.

HBase is a non-relational, column-oriented database management system that runs on top of Hadoop Distributed File System (HDFS). It is well suited for sparse data sets, which are common in many big data use cases.

Unlike relational database systems, HBase does not support a structured query language like SQL; in fact, HBase isn’t a relational data store at all. HBase applications are written in Java much like a typical Apache™ MapReduce application. HBase does support writing applications in Apache™ Avro™, REST, and Thrift.

An HBase system comprises a set of tables. Each table contains rows and columns, much like a traditional database. Each table must have an element defined as a Primary Key, and all access attempts to HBase tables must use this Primary Key.

Hbase provides random, real time access to your Big Data. HBase was created for hosting very large tables with billions of rows and millions of columns.