Herein you will find either the definitive documentation on an HBase topic as of its standing when the referenced HBase version shipped, or it will point to the location in Javadoc or JIRA where the pertinent information can be found. The source for this guide can be found in the _src/main/asciidoc directory of the HBase source.
This reference guide is marked up using Ascii Doc from which the finished guide is generated as part of the 'site' build target.
In reality, you need a fully-distributed configuration to fully test HBase and to use it in real-world scenarios.
In a distributed configuration, the cluster contains multiple nodes, each of which runs one or more HBase daemon.
Please read this chapter carefully, especially the Basic Prerequisites to ensure that your HBase testing and deployment goes smoothly, and prevent data loss.
Familiarize yourself with Support and Testing Expectations as well.
so that the Hadoop and HBase daemons can be managed.
Pseudo-distributed mode means that HBase still runs completely on a single host, but each HBase daemon (HMaster, HRegion Server, and Zoo Keeper) runs as a separate process: in standalone mode all daemons ran in one jvm process/instance.
By default, unless you configure the This procedure assumes that you have configured Hadoop and HDFS on your local system and/or a remote system, and that they are running and available. The guide on Setting up a Single Node Cluster in the Hadoop documentation is a good starting point.
It builds upon the previous quickstart, Pseudo-Distributed Local Install, assuming that the system you configured in that procedure is now The next chapter, configuration, gives more information about the different HBase run modes, system requirements for running HBase, and critical configuration areas for setting up a distributed HBase cluster.
This chapter expands upon the Getting Started chapter to further explain configuration of Apache HBase.