
Apache Spark 2.x Machine Learning Cookbook
- 666 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Apache Spark 2.x Machine Learning Cookbook
About this book
Simplify machine learning model implementations with SparkAbout This Book⢠Solve the day-to-day problems of data science with Spark⢠This unique cookbook consists of exciting and intuitive numerical recipes⢠Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your dataWho This Book Is ForThis book is for Scala developers with a fairly good exposure to and understanding of machine learning techniques, but lack practical implementations with Spark. A solid knowledge of machine learning algorithms is assumed, as well as hands-on experience of implementing ML algorithms with Scala. However, you do not need to be acquainted with the Spark ML libraries and ecosystem.What You Will Learn⢠Get to know how Scala and Spark go hand-in-hand for developers when developing ML systems with Spark⢠Build a recommendation engine that scales with Spark⢠Find out how to build unsupervised clustering systems to classify data in Spark⢠Build machine learning systems with the Decision Tree and Ensemble models in Spark⢠Deal with the curse of high-dimensionality in big data using Spark⢠Implement Text analytics for Search Engines in Spark⢠Streaming Machine Learning System implementation using SparkIn DetailMachine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability, and optimization. Learning about algorithms enables a wide range of applications, from everyday tasks such as product recommendations and spam filtering to cutting edge applications such as self-driving cars and personalized medicine. You will gain hands-on experience of applying these principles using Apache Spark, a resilient cluster computing system well suited for large-scale machine learning tasks.This book begins with a quick overview of setting up the necessary IDEs to facilitate the execution of code examples that will be covered in various chapters. It also highlights some key issues developers face while working with machine learning algorithms on the Spark platform. We progress by uncovering the various Spark APIs and the implementation of ML algorithms with developing classification systems, recommendation engines, text analytics, clustering, and learning systems. Toward the final chapters, we'll focus on building high-end applications and explain various unsupervised methodologies and challenges to tackle when implementing with big data ML systems.Style and approachThis book is packed with intuitive recipes supported with line-by-line explanations to help you understand how to optimize your work flow and resolve problems when working with complex data modeling tasks and predictive algorithms. This is a valuable resource for data scientists and those working on large scale data projects.
Tools to learn more effectively

Saving Books

Keyword Search

Annotating Text

Listen to it instead
Information
Practical Machine Learning with Regression and Classification in Spark 2.0 - Part I
- Fitting a linear regression line to data the old-fashioned way
- Generalized linear regression in Spark 2.0
- Linear regression API with Lasso and L-BFGS in Spark 2.0
- Linear regression API with Lasso and auto optimization selection in Spark 2.0
- Linear regression API with ridge regression and auto optimization selection in Spark 2.0
- Isotonic regression in Apache Spark 2.0
- Multilayer perceptron classifier in Apache Spark 2.0
- One versus Rest classifier (One-vs-All) in Apache Spark 2.0
- Survival regression - parametric AFT model in Apache Spark 2.0
Introduction

Fitting a linear regression line to data the old fashioned way
How to do it...
- Start a new project in IntelliJ or in an IDE of ...
Table of contents
- Title Page
- Copyright
- Credits
- About the Authors
- About the Reviewer
- www.PacktPub.com
- Customer Feedback
- Preface
- Practical Machine Learning with Spark Using Scala
- Just Enough Linear Algebra for Machine Learning with Spark
- Spark's Three Data Musketeers for Machine Learning - Perfect Together
- Common Recipes for Implementing a Robust Machine Learning System
- Practical Machine Learning with Regression and Classification in Spark 2.0 - Part I
- Practical Machine Learning with Regression and Classification in Spark 2.0 - Part II
- Recommendation Engine that Scales with Spark
- Unsupervised Clustering with Apache Spark 2.0
- Optimization - Going Down the Hill with Gradient Descent
- Building Machine Learning Systems with Decision Tree and Ensemble Models
- Curse of High-Dimensionality in Big Data
- Implementing Text Analytics with Spark 2.0 ML Library
- Spark Streaming and Machine Learning Library
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app