HBase High Performance Cookbook
eBook - ePub

HBase High Performance Cookbook

  1. 350 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

HBase High Performance Cookbook

About this book

Exciting projects that will teach you how complex data can be exploited to gain maximum insightsAbout This Book• Architect a good HBase cluster for a very large distributed system• Get to grips with the concepts of performance tuning with HBase• A practical guide full of engaging recipes and attractive screenshots to enhance your system's performanceWho This Book Is ForThis book is intended for developers and architects who want to know all about HBase at a hands-on level. This book is also for big data enthusiasts and database developers who have worked with other NoSQL databases and now want to explore HBase as another futuristic scalable database solution in the big data space.What You Will Learn• Configure HBase from a high performance perspective• Grab data from various RDBMS/Flat files into the HBASE systems• Understand table design and perform CRUD operations• Find out how the communication between the client and server happens in HBase• Grasp when to use and avoid MapReduce and how to perform various tasks with it• Get to know the concepts of scaling with HBase through practical examples• Set up Hbase in the Cloud for a small scale environment• Integrate HBase with other tools including ElasticSearchIn DetailApache HBase is a non-relational NoSQL database management system that runs on top of HDFS. It is an open source, disturbed, versioned, column-oriented store and is written in Java to provide random real-time access to big Data.We'll start off by ensuring you have a solid understanding the basics of HBase, followed by giving you a thorough explanation of architecting a HBase cluster as per our project specifications. Next, we will explore the scalable structure of tables and we will be able to communicate with the HBase client. After this, we'll show you the intricacies of MapReduce and the art of performance tuning with HBase. Following this, we'll explain the concepts pertaining to scaling with HBase. Finally, you will get an understanding of how to integrate HBase with other tools such as ElasticSearch.By the end of this book, you will have learned enough to exploit HBase for boost system performance.Style and approachThis book is intended for software quality assurance/testing professionals, software project managers, or software developers with prior experience in using Selenium and Java to test web-based applications. This books also provides examples for C#, Python, and Ruby users.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access HBase High Performance Cookbook by Ruchir Choudhry in PDF and/or ePUB format, as well as other popular books in Computer Science & Databases. We have over one million books available in our catalogue for you to explore.

Information

HBase High Performance Cookbook


Table of Contents

HBase High Performance Cookbook
Credits
About the Author
About the Reviewer
www.PacktPub.com
eBooks, discount offers, and more
Why Subscribe?
Customer Feedback
Preface
What this book covers
What you need for this book
Who this book is for
Sections
Getting ready
How to do it…
How it works…
There's more…
See also
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. Configuring HBase
Introduction
Configuring and deploying HBase
Getting ready
How to do it…
How it works…
There's more…
See Also
Using the filesystem
Getting ready
How to do it…
The HBase setup
Starting the cluster
Validating the cluster
How it works…
There is more…
See also
Administering clusters
Getting ready
How to do it…
Log dump
Metrics dump
How it works…
See also
Managing clusters
Getting ready
gmond
gmetad
gweb
How to do it…
Ganglia setup
How it works…
There is more…
See also
2. Loading Data from Various DBs
Introduction
Extracting data from Oracle
Getting ready
How to do it…
How it works…
There's more…
See also…
Loading data using Oracle Big data connector
Getting Ready
How to do it…
How it works…
There's more…
See also…
Bulk utilities
Getting ready...
How to do it…
How it works…
See also…
Using Hive with Apache HBase
Getting ready
How to do it…
How it works…
See also…
Using Sqoop
Getting ready
How to do it…
How it works…
There's more…
Data compression
Parallelism
See also…
3. Working with Large Distributed Systems Part I
Introduction
Scaling elastically or Auto Scaling with built-in fault tolerance
How to do it…
How it works…
There's more…
See also
Auto Scaling HBase using AWS
Getting Ready
How to do it…
There's more…
See also
Works on different VM/physical, cloud hardware
Getting ready
How to do it…
There's more…
See also
4. Working with Large Distributed Systems Part II
Introduction
Seek versus transfer
The log-structured merge-tree
Date Read
Data Delete
Storage
Read path
How to do it…
There's more…
Write Path
How to do it…
How it works…
There's more…
Transactions (ACID) and multiversion concurrency control (MVCC)
Snappy
How to do it…
How it works…
There's more…
LZO compression
How to do it…
How it works...
There's more…
LZ4 compressor
How to do it…
There's more…
Replication
How to do it…
Deploying Master-Master or Cyclic Replication
How it works...
There's more…
Disabling Replication at the Peer Level
5. Working with Scalable Structure of tables
Introduction
HBase data model part 1
How to do it…
How it works…
There's more…
HBase data model part 2
How to do it…
How it works…
There's more…
How HBase truly scales on key and schema design
How to do it…
See also
6. HBase Clients
Introduction
HBase REST and Java Client
How to do it…
How it works…
There's more…
Working with Apache Thrift
How to do it…
How it works…
There's more…
Working with Apache Avro
How to do it…
How it works…
There's more…
Working with Protocol buffer
How to do it…
There's More…
Working with Pig and using Shell
How to do it…
How it works…
There's more…
7. Large-Scale MapReduce
Introduction
Getting Ready…
How to do it…
How it works…
There's more…
When not to use MapReduce
See also…
8. HBase Performance Tuning
Introduction
Working with infrastructure/operating systems
Getting ready…
How to do it…
Working with Java virtual machines
Getting ready…
How to do it…
See also
Changing the configuration of components
Getting ready…
How to do it…
See also
Working with HDFS
How to do it…
See also….
9. Performing Advanced Tasks on HBase
Machine learning using Hbase
Getting ready
How to do it…
RDBMS
A plain Java program (static)
There's more…
Real-time data analysis using Hbase and Mahout
How to do it…
How it works...
There's More…
Full text indexing using Hbase
Getting ready
How to do it…
How it works…
There's more…
10. Optimizing Hbase for Cloud
Introduction
Configuring Hbase for the Cloud
How to do it…
How it works…
Connecting to an Hbase cluster using the command line
How to do it…
How it works…
Backing up and restoring Hbase
How to do it…
How it works…
Terminating an HBase cluster
How to do it…
Accessing HBase data with hive
How to do it …
Viewing the Hbase user interface
How to do it …
Monitoring HBase with CloudWatch
Monitoring Hbase with Ganglia
How it works…
There is more …
11. Case Study
Introduction
Configuring Lily Platform
How to do it…
There's more…
Integrating elastic search with Hbase
Configuring
How to do it…
There's more…
Index

HBase High Performance Cookbook

Copyright Ā© 2017 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trad...

Table of contents

  1. HBase High Performance Cookbook