eBook - ePub
Aster Data SQL and MapReduce
Tom Coffing, John Nolan
This is a test
Partager le livre
- 437 pages
- English
- ePUB (adapté aux mobiles)
- Disponible sur iOS et Android
eBook - ePub
Aster Data SQL and MapReduce
Tom Coffing, John Nolan
DĂ©tails du livre
Aperçu du livre
Table des matiĂšres
Citations
Ă propos de ce livre
The Aster Data SQL and MapReduce book shows you the fundamental architecture that will allow you to fully understand how Aster Data works. You will be able to create tables, perform partitioning, follow the best Aster modeling strategies, and have a great reference for Aster Data SQL. You will also elevate your knowledge immensely from the brilliant MapReduce portion of the books. You will have MapReduce examples, explanations, and workshops that are designed to make you a MapReduce wizard.
Foire aux questions
Comment puis-je résilier mon abonnement ?
Il vous suffit de vous rendre dans la section compte dans paramĂštres et de cliquer sur « RĂ©silier lâabonnement ». Câest aussi simple que cela ! Une fois que vous aurez rĂ©siliĂ© votre abonnement, il restera actif pour le reste de la pĂ©riode pour laquelle vous avez payĂ©. DĂ©couvrez-en plus ici.
Puis-je / comment puis-je télécharger des livres ?
Pour le moment, tous nos livres en format ePub adaptĂ©s aux mobiles peuvent ĂȘtre tĂ©lĂ©chargĂ©s via lâapplication. La plupart de nos PDF sont Ă©galement disponibles en tĂ©lĂ©chargement et les autres seront tĂ©lĂ©chargeables trĂšs prochainement. DĂ©couvrez-en plus ici.
Quelle est la différence entre les formules tarifaires ?
Les deux abonnements vous donnent un accĂšs complet Ă la bibliothĂšque et Ă toutes les fonctionnalitĂ©s de Perlego. Les seules diffĂ©rences sont les tarifs ainsi que la pĂ©riode dâabonnement : avec lâabonnement annuel, vous Ă©conomiserez environ 30 % par rapport Ă 12 mois dâabonnement mensuel.
Quâest-ce que Perlego ?
Nous sommes un service dâabonnement Ă des ouvrages universitaires en ligne, oĂč vous pouvez accĂ©der Ă toute une bibliothĂšque pour un prix infĂ©rieur Ă celui dâun seul livre par mois. Avec plus dâun million de livres sur plus de 1 000 sujets, nous avons ce quâil vous faut ! DĂ©couvrez-en plus ici.
Prenez-vous en charge la synthÚse vocale ?
Recherchez le symbole Ăcouter sur votre prochain livre pour voir si vous pouvez lâĂ©couter. Lâoutil Ăcouter lit le texte Ă haute voix pour vous, en surlignant le passage qui est en cours de lecture. Vous pouvez le mettre sur pause, lâaccĂ©lĂ©rer ou le ralentir. DĂ©couvrez-en plus ici.
Est-ce que Aster Data SQL and MapReduce est un PDF/ePUB en ligne ?
Oui, vous pouvez accĂ©der Ă Aster Data SQL and MapReduce par Tom Coffing, John Nolan en format PDF et/ou ePUB ainsi quâĂ dâautres livres populaires dans Informatik et Data-Warehousing. Nous disposons de plus dâun million dâouvrages Ă dĂ©couvrir dans notre catalogue.
Informations
Sujet
InformatikSous-sujet
Data-WarehousingChapter 1 â The Aster Data Architecture
âDesign is not just what it looks like and feels like. Design is how it works.â
- Steve Jobs
What is Parallel Processing?
âAfter enlightenment, the laundryâ
- Zen Proverb
âAfter parallel processing the laundry, enlightenment!â
-Aster Zen Proverb
Two guys were having fun on a Saturday night when one said, âIâve got to go and do my laundry.â The other said, âWhat?!â The man explained that if he went to the laundry mat the next morning, he would be lucky to get one machine and be there all day. But, if he went on Saturday night, he could get all the machines. Then, he could do all his wash and dry in two hours. Now thatâs parallel processing mixed in with a little dry humor!
Aster Data is a Parallel Processing System
Each vworker holds a portion of every table and is responsible for reading and writing the data that it is assigned to and from its disk. Queries are submitted to the queen who plans, optimizes, and manages the execution of the query by sending the necessary subqueries to each vworker. Each vworker performs its subquery or subqueries independent of the others, completely following only the queenâs plan. The final results of queries performed on each vworker is returned to the queen where they can be combined and delivered back to the user.
Each vworker holds a Portion of Every Table
When a table is created on Aster, each vworker receives that table. When data is loaded, the rows are hashed by a distribution key so each vworker holds a certain portion of the rows. If the queen orders a full table scan of a particular table, then all vworkers simultaneously read their portion of the data. This is the concept of parallel processing.
The Rows of a Table are Spread Across All vworkers
A Distribution Key will be hashed to distribute the rows among the vworkers. Each vworker will hold a portion of the rows. This is the concept behind parallel processing.
Aster Tables are defined as Fact or Dimension when Created
An Aster Table will be either a Fact or Dimension Table. Fact tables are usually large, and dimension tables are relatively smaller. Fact tables will generally be distributed by hash on a distribution key which is a key column in the table. Dimension tables are usually distributed by replicating the table across all vworkers.
Fact Table
A Distribution Key will be hashed to distribute the rows among the vworkers.
A More Detailed Look at the Fact Table Distribution
A Distribution Key will be hashed to distribute the rows among the vworkers. The entire row will be held by the vworker, but the row finds its vworker based on hash.
Dimension Table are Replicated
Dimension tables are relatively smaller than the large fact table they join to. Dimension tables are usually, but not always, distributed by replicating the table across all vworkers. That means that each vworker has the exact same copy of the entire table.
A Dimension Table is often Replicated across vworkers
Fact and Dimension tables are created in this manner for join purposes. Dimension tables are smaller so they are replicated, but Fact tables are distributed by a hash key.