Learn Big Data from the ground up with this complete and up-to-date resource from leaders in the fieldĀ
Big Data: Concepts, Technology, and ArchitectureĀ delivers a comprehensive treatment of Big Data tools, terminology, and technology perfectly suited to a wide range of business professionals, academic researchers, and students. Beginning with a fulsome overview of what we mean when weĀ say,Ā "Big Data," the book moves on to discuss every stage of the lifecycle of Big Data.Ā
You'll learn about the creation of structured, unstructured, and semi-structured data, data storage solutions, traditional database solutions like SQL, data processing, data analytics, machine learning, and data mining. You'll also discover how specific technologies like Apache Hadoop, SQOOP, and Flume work.Ā
Big DataĀ also covers the central topic of big data visualization with Tableau, and you'll learn how to create scatter plots, histograms, bar, line, and pie charts with that software.Ā
Accessibly organized,Ā Big DataĀ includes illuminating case studies throughout the material, showing you how the included concepts have been applied in real-world settings. Some of those concepts include:Ā
- The common challenges facing big data technology and technologists, like data heterogeneity and incompleteness, data volume and velocity, storage limitations, and privacy concernsĀ
- Relational and non-relational databases, like RDBMS, NoSQL, and NewSQL databasesĀ
- Virtualizing Big Data through encapsulation, partitioning, and isolating, as well as big data server virtualizationĀ
- Apache software, including Hadoop, Cassandra, Avro, Pig, Mahout, Oozie, and HiveĀ
- The Big Data analytics lifecycle, including business case evaluation, data preparation, extraction, transformation, analysis, and visualizationĀ
Perfect for data scientists, data engineers, and database managers,Ā Big DataĀ also belongs on the bookshelves of business intelligence analysts who are required to make decisions based on large volumes of information. Executives and managers who lead teams responsible for keeping or understanding large datasets will also benefit from this book.Ā
Ā
