
- 420 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Mastering Machine Learning with R - Second Edition
About this book
Master machine learning techniques with R to deliver insights in complex projectsAbout This Book⢠Understand and apply machine learning methods using an extensive set of R packages such as XGBOOST⢠Understand the benefits and potential pitfalls of using machine learning methods such as Multi-Class Classification and Unsupervised Learning⢠Implement advanced concepts in machine learning with this example-rich guideWho This Book Is ForThis book is for data science professionals, data analysts, or anyone with a working knowledge of machine learning, with R who now want to take their skills to the next level and become an expert in the field.What You Will Learn⢠Gain deep insights into the application of machine learning tools in the industry⢠Manipulate data in R efficiently to prepare it for analysis⢠Master the skill of recognizing techniques for effective visualization of data⢠Understand why and how to create test and training data sets for analysis⢠Master fundamental learning methods such as linear and logistic regression⢠Comprehend advanced learning methods such as support vector machines⢠Learn how to use R in a cloud service such as AmazonIn DetailThis book will teach you advanced techniques in machine learning with the latest code in R 3.3.2. You will delve into statistical learning theory and supervised learning; design efficient algorithms; learn about creating Recommendation Engines; use multi-class classification and deep learning; and more.You will explore, in depth, topics such as data mining, classification, clustering, regression, predictive modeling, anomaly detection, boosted trees with XGBOOST, and more. More than just knowing the outcome, you'll understand how these concepts work and what they do.With a slow learning curve on topics such as neural networks, you will explore deep learning, and more. By the end of this book, you will be able to perform machine learning with R in the cloud using AWS in various scenarios with different datasets.Style and approachThe book delivers practical and real-world solutions to problems and a variety of tasks such as complex recommendation systems. By the end of this book, you will have gained expertise in performing R machine learning and will be able to build complex machine learning projects using R and its packages.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
Market Basket Analysis, Recommendation Engines, and Sequential Analysis
- Jeff Eisenberg, CEO of BuyerLegends.com
- Warren Buffett
Another technique a business can use is to understand the sequence in which you purchase or use their products and services. This is called sequential analysis. A very common implementation of this methodology is to understand how customers click through various webpages and/or links.
An overview of a market basket analysis
- Itemset: This is a collection of one or more items in the dataset.
- Support: This is the proportion of the transactions in the data that contain an itemset of interest.
- Confidence: This is the conditional probability that if a person purchases or does x, they will purchase or do y; the act of doing x is referred to as the antecedent or Left-Hand Side (LHS), and y is the consequence or Right-Hand Side (RHS).
- Lift: This is the ratio of the support of x occurring together with y divided by the probability that x and y occur if they are independent. It is the confidence divided by the probability of x times the probability of y; for example, say that we have the probability of x and y occurring together as 10 percent and the probability of x is 20 percent and y is 30 percent, then the lift would be 10 percent (20 percent times 30 percent) or 16.67 percent.
- Let k=1 (the number of items)
- Generate itemsets of a length that are equal to or greater than the specified support
- Iterate k + (1...n), pruning those that are infrequent (less than the support)
- Stop the iteration when no new frequent itemsets are identified
Business understanding
Let's say that we are a start-up microbrewery trying to make a headway in this grocery outlet and want to develop an understanding of what potential customers will purchase along with beer. This knowledge may just help us in identifying the right product placement within the store or support a cross-selling campaign.
Data understanding and preparation
Table of contents
- Title Page
- Copyright
- Credits
- About the Author
- About the Reviewers
- Packt Upsell
- Customer Feedback
- Preface
- A Process for Success
- Linear Regression - The Blocking and Tackling of Machine Learning
- Logistic Regression and Discriminant Analysis
- Advanced Feature Selection in Linear Models
- More Classification Techniques - K-Nearest Neighbors and Support Vector Machines
- Classification and Regression Trees
- Neural Networks and Deep Learning
- Cluster Analysis
- Principal Components Analysis
- Market Basket Analysis, Recommendation Engines, and Sequential Analysis
- Creating Ensembles and Multiclass Classification
- Time Series and Causality
- Text Mining
- R on the Cloud
- R Fundamentals
- Sources