Real-time Analytics with Storm and Cassandra
eBook - ePub

Real-time Analytics with Storm and Cassandra

Shilpi Saxena

Buch teilen
  1. 220 Seiten
  2. English
  3. ePUB (handyfreundlich)
  4. Über iOS und Android verfügbar
eBook - ePub

Real-time Analytics with Storm and Cassandra

Shilpi Saxena

Angaben zum Buch
Buchvorschau
Inhaltsverzeichnis
Quellenangaben

Über dieses Buch

About This Book

  • Create your own data processing topology and implement it in various real-time scenarios using Storm and Cassandra
  • Build highly available and linearly scalable applications using Storm and Cassandra that will process voluminous data at lightning speed
  • A pragmatic and example-oriented guide to implement various applications built with Storm and Cassandra

Who This Book Is For

If you want to efficiently use Storm and Cassandra together and excel at developing production-grade, distributed real-time applications, then this book is for you. No prior knowledge of using Storm and Cassandra together is necessary. However, a background in Java is expected.

Häufig gestellte Fragen

Wie kann ich mein Abo kündigen?
Gehe einfach zum Kontobereich in den Einstellungen und klicke auf „Abo kündigen“ – ganz einfach. Nachdem du gekündigt hast, bleibt deine Mitgliedschaft für den verbleibenden Abozeitraum, den du bereits bezahlt hast, aktiv. Mehr Informationen hier.
(Wie) Kann ich Bücher herunterladen?
Derzeit stehen all unsere auf Mobilgeräte reagierenden ePub-Bücher zum Download über die App zur Verfügung. Die meisten unserer PDFs stehen ebenfalls zum Download bereit; wir arbeiten daran, auch die übrigen PDFs zum Download anzubieten, bei denen dies aktuell noch nicht möglich ist. Weitere Informationen hier.
Welcher Unterschied besteht bei den Preisen zwischen den Aboplänen?
Mit beiden Aboplänen erhältst du vollen Zugang zur Bibliothek und allen Funktionen von Perlego. Die einzigen Unterschiede bestehen im Preis und dem Abozeitraum: Mit dem Jahresabo sparst du auf 12 Monate gerechnet im Vergleich zum Monatsabo rund 30 %.
Was ist Perlego?
Wir sind ein Online-Abodienst für Lehrbücher, bei dem du für weniger als den Preis eines einzelnen Buches pro Monat Zugang zu einer ganzen Online-Bibliothek erhältst. Mit über 1 Million Büchern zu über 1.000 verschiedenen Themen haben wir bestimmt alles, was du brauchst! Weitere Informationen hier.
Unterstützt Perlego Text-zu-Sprache?
Achte auf das Symbol zum Vorlesen in deinem nächsten Buch, um zu sehen, ob du es dir auch anhören kannst. Bei diesem Tool wird dir Text laut vorgelesen, wobei der Text beim Vorlesen auch grafisch hervorgehoben wird. Du kannst das Vorlesen jederzeit anhalten, beschleunigen und verlangsamen. Weitere Informationen hier.
Ist Real-time Analytics with Storm and Cassandra als Online-PDF/ePub verfügbar?
Ja, du hast Zugang zu Real-time Analytics with Storm and Cassandra von Shilpi Saxena im PDF- und/oder ePub-Format sowie zu anderen beliebten Büchern aus Informatica & Elaborazione di dati. Aus unserem Katalog stehen dir über 1 Million Bücher zur Verfügung.

Information

Jahr
2015
ISBN
9781784390006

Real-time Analytics with Storm and Cassandra


Table of Contents

Real-time Analytics with Storm and Cassandra
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. Let's Understand Storm
Distributed computing problems
Real-time business solution for credit or debit card fraud detection
Aircraft Communications Addressing and Reporting system
Healthcare
Other applications
Solutions for complex distributed use cases
The Hadoop solution
A custom solution
Licensed proprietary solutions
Other real-time processing tools
A high-level view of various components of Storm
Delving into the internals of Storm
Quiz time
Summary
2. Getting Started with Your First Topology
Prerequisites for setting up Storm
Components of a Storm topology
Spouts
Bolts
Streams
Tuples – the data model in Storm
Executing a sample Storm topology – local mode
WordCount topology from the Storm-starter project
Executing the topology in the distributed mode
Set up Zookeeper (V 3.3.5) for Storm
Setting up Storm in the distributed mode
Launching Storm daemons
Executing the topology from Command Prompt
Tweaking the WordCount topology to customize it
Quiz time
Summary
3. Understanding Storm Internals by Examples
Customizing Storm spouts
Creating FileSpout
Tweaking WordCount topology to use FileSpout
The SocketSpout class
Anchoring and acking
The unreliable topology
Stream groupings
Local or shuffle grouping
Fields grouping
All grouping
Global grouping
Custom grouping
Direct grouping
Quiz time
Summary
4. Storm in a Clustered Mode
The Storm cluster setup
Zookeeper configurations
Cleaning up Zookeeper
Storm configurations
Storm logging configurations
The Storm UI
Section 1
Section 2
Section 3
Section 4
The visualization section
Storm monitoring tools
Quiz time
Summary
5. Storm High Availability and Failover
An overview of RabbitMQ
Installing the RabbitMQ cluster
Prerequisites for the setup of RabbitMQ
Setting up a RabbitMQ server
Testing the RabbitMQ server
Creating a RabbitMQ cluster
Enabling the RabbitMQ UI
Creating mirror queues for high availability
Integrating Storm with RabbitMQ
Creating a RabbitMQ feeder component
Wiring the topology for the AMQP spout
Building high availability of components
High availability of the Storm cluster
Guaranteed processing of the Storm cluster
The Storm isolation scheduler
Quiz time
Summary
6. Adding NoSQL Persistence to Storm
The advantages of Cassandra
Columnar database fundamentals
Types of column families
Types of columns
Setting up the Cassandra cluster
Installing Cassandra
Multiple data centers
Prerequisites for setting up multiple data centers
Installing Cassandra data centers
Introduction to CQLSH
Introduction to CLI
Using different client APIs to access Cassandra
Storm topology wired to the Cassandra store
The best practices for Storm/Cassandra applications
Quiz time
Summary
7. Cassandra Partitioning, High Availability, and Consistency
Consistent hashing
One or more node goes down
One or more node comes back up
Replication in Cassandra and strategies
Cassandra consistency
Write consistency
Read consistency
Consistency maintenance features
Quiz time
Summary
8. Cassandra Management and Maintenance
Cassandra – gossip protocol
Bootstrapping
Failure scenario handling – detection and recovery
Cassandra cluster scaling – adding a new node
Cassandra cluster – replacing a dead node
The replication factor
The nodetool commands
Cassandra fault tolerance
Cassandra monitoring systems
JMX monitoring
Datastax OpsCenter
Quiz time
Summary
9. Storm Management and Maintenance
Scaling the Storm cluster – adding new supervisor nodes
Scaling the Storm cluster and rebalancing the topology
Rebalancing using the GUI
Rebalancing using the CLI
Setting up workers and parallelism to enhance processing
Scenario 1
Scenario 2
Scenario 3
Storm troubleshooting
The Storm UI
Storm logs
Quiz time
Summary
10. Advance Concepts in Storm
Building a Trident topology
Understanding the Trident API
Local partition manipulation operation
Functions
Filters
partitionAggregate
Sum aggregate
CombinerAggregator
ReducerAggregator
Aggregator
Operations related to stream repartitioning
Data aggregations over the streams
Grouping over a field in a stream
Merge and join
Examples and illustrations
Quiz time
Summary
11. Distributed Cache and CEP with Storm
The need for distributed caching in Storm
Introduction to memcached
Setting up memcache
Building a topology with a cache
Introduction to the complex event processing engine
Esper
Getting started with Esper
Integrating Esper with Storm
Quiz time
Summary
A. Quiz Answers
Chapter 1
Chapter 2
Chapter 3
Chapter 4
Chapter 5
Chapter 6
Chapter 7
Chapter 8
Chapter 9
Chapter 10
Chapter 11
Index

Real-time Analytics with Storm and Cassandra

Copyright © 2015 Packt Publishing
All rights reserved. No part of this book ma...

Inhaltsverzeichnis