Aster Data Database Administration
eBook - ePub

Aster Data Database Administration

Tom Coffing, Todd Wilson

Share book
  1. 759 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Aster Data Database Administration

Tom Coffing, Todd Wilson

Book details
Book preview
Table of contents
Citations

About This Book

The Aster Data Database Administration book is the Swiss Army Knife for the DBA. This book describes almost every subject on Aster Data. The DBA will be able to utilize the Aster Data AMC to perform heavy Aster Data DBA functions, be able to perform Backups and Recoveries, understand how to create and design tables for maximum performance, setup the workload management, have advanced SQL and Analytic examples at their fingertips, and understand the complicated MapReduce capabilities of Aster Data. This book is nearly 800 pages of beautiful colored examples that no Aster Data DBA should be without.

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Aster Data Database Administration an online PDF/ePUB?
Yes, you can access Aster Data Database Administration by Tom Coffing, Todd Wilson in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Warehousing. We have over one million books available in our catalogue for you to explore.

Information

Year
2014
ISBN
9781940540221

Chapter 1 ā€“ The Aster Data Architecture

ā€œDesign is not just what it looks like and feels like. Design is how it works.ā€
- Steve Jobs

What is Parallel Processing?

ā€œAfter enlightenment, the laundryā€
- Zen Proverb
image
ā€œAfter parallel processing the laundry, enlightenment!ā€
-Aster Zen Proverb
Two guys were having fun on a Saturday night when one said, ā€œIā€™ve got to go and do my laundry.ā€ The other said, ā€œWhat?!ā€ The man explained that if he went to the laundry mat the next morning, he would be lucky to get one machine and be there all day. But, if he went on Saturday night, he could get all the machines. Then, he could do all his wash and dry in two hours. Now thatā€™s parallel processing mixed in with a little dry humor!

Aster Data is a Parallel Processing System

image
The queen takes the request from the user and builds the plan for the vworkers. The vworkers retrieve their portion of the data and pass the results to the queen. The queen delivers the answer set to the user.
Each vworker holds a portion of every table and is responsible for reading and writing the data that it is assigned to and from its disk. Queries are submitted to the queen who plans, optimizes, and manages the execution of the query by sending the necessary subqueries to each vworker. Each vworker performs its subquery or subqueries independent of the others, completely following only the queenā€™s plan. The final results of queries performed on each vworker is returned to the queen where they can be combined and delivered back to the user.

Each vworker holds a Portion of Every Table

image
Every vworker has the exact same tables, but each vworker holds different rows of those tables.
When a table is created on Aster, each vworker receives that table. When data is loaded, the rows are hashed by a distribution key, so each vworker holds a certain portion of the rows. If the queen orders a full table scan of a particular table, then all vworkers simultaneously read their portion of the data. This is the concept of parallel processing.

The Rows of a Table are Spread Across All vworkers

image
A Distribution Key will be hashed to distribute the rows among the vworkers. Each vworker will hold a portion of the rows. This is the concept behind parallel processing.

The Aster Data Architecture

image
Aster can scale to thousands of nodes which are standard, inexpensive commodity x86 servers with locally-attached disk storage and networked with other nodes using commodity Gigabit Ethernet (GigE) technology. The Queen Node is the brains behind the operation. The Worker Nodes hold the data and do the processing based on the Queen's plan. The Loader Nodes load the data and export data off of Aster. The Backup node provides large disks that backup and restore data to the Aster system.

The Queen Node

Queen Node
ā€¢ The queen node comes up with a plan for the vworkers to retrieve their portion of the data.
ā€¢ The queen is the coordinator of how the data is distributed across vworkers.
ā€¢ The queen delivers the answer set to the end user.
ā€¢ The queen node is the software coordinator and keeper of the data dictionary and other system tables.
ā€¢ You can maintain an inactive queen as a backup.
ā€¢ The queen provides the cluster logic that glues all nodes of the system together. The queen is responsible for all cluster, transaction, and storage management aspects of the system.

The Worker Node

image
ā€¢ Worker nodes are where the data is stored and analyzed.
ā€¢ A worker node is comprised of multiple vworkers.
ā€¢ Each vworker has their own storage, and the vworkers work in parallel to process data simultaneously.
ā€¢ The queen communicates with vworkers via standard SQL, and the vworkers on various worker nodes communicate with each other.

The Loader Node

image
ā€¢ The Aster Database Loader utility and loader nodes form the massively parallel backbone of the Aster Database for performing data loads and exports.
ā€¢ Loader nodes are designed to be CPU-heavy nodes that have no major disk capacity.
ā€¢ These independent nodes also help isolate loads and exports from query processing.
ā€¢ The Aster Database Loader utility communicates with loader nodes and acts as a landing zone for bulk data during both loads and exports.
ā€¢ Because of this brilliant design, Aster can process queries while loading fresh data, and it can process queries and loads while exporting data off of Aster.
The loader nodes handle all aspects of load and export: transforming and reformatting data. When loading, the loader nodes also generate the hash keys and perform compression. The loader nodes handle the processor intensive functions normally required by the host in most systems, thus providing great import and export capabilities while queries and backups are running simultaneously.

The Backup Node

image
ā€¢ Aster can backup your entire Aster system or just individual tables on backup nodes.
ā€¢ The backup nodes are not an Aster Database. Instead, they are a set of disk-heavy Aster Database Backup Nodes designed for backup purposes only.
ā€¢ The Backup Nodes can be used to restore data to the Aster Dat...

Table of contents