R for Data Science Cookbook
eBook - ePub

R for Data Science Cookbook

  1. 452 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

R for Data Science Cookbook

About this book

Over 100 hands-on recipes to effectively solve real-world data problems using the most popular R packages and techniquesAbout This Book• Gain insight into how data scientists collect, process, analyze, and visualize data using some of the most popular R packages• Understand how to apply useful data analysis techniques in R for real-world applications• An easy-to-follow guide to make the life of data scientist easier with the problems faced while performing data analysisWho This Book Is ForThis book is for those who are already familiar with the basic operation of R, but want to learn how to efficiently and effectively analyze real-world data problems using practical R packages.What You Will Learn• Get to know the functional characteristics of R language• Extract, transform, and load data from heterogeneous sources• Understand how easily R can confront probability and statistics problems• Get simple R instructions to quickly organize and manipulate large datasets• Create professional data visualizations and interactive reports• Predict user purchase behavior by adopting a classification approach• Implement data mining techniques to discover items that are frequently purchased together• Group similar text documents by using various clustering methodsIn DetailThis cookbook offers a range of data analysis samples in simple and straightforward R code, providing step-by-step resources and time-saving methods to help you solve data problems efficiently.The first section deals with how to create R functions to avoid the unnecessary duplication of code. You will learn how to prepare, process, and perform sophisticated ETL for heterogeneous data sources with R packages. An example of data manipulation is provided, illustrating how to use the "dplyr" and "data.table" packages to efficiently process larger data structures. We also focus on "ggplot2" and show you how to create advanced figures for data exploration.In addition, you will learn how to build an interactive report using the "ggvis" package. Later chapters offer insight into time series analysis on financial data, while there is detailed information on the hot topic of machine learning, including data classification, regression, clustering, association rule mining, and dimension reduction.By the end of this book, you will understand how to resolve issues and will be able to comfortably offer solutions to problems encountered while performing data analysis.Style and approachThis easy-to-follow guide is full of hands-on examples of data analysis with R. Each topic is fully explained beginning with the core concept, followed by step-by-step practical examples, and concluding with detailed explanations of each concept used.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access R for Data Science Cookbook by Yu-Wei, Chiu (David Chiu) in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Modelling & Design. We have over one million books available in our catalogue for you to explore.

R for Data Science Cookbook


Table of Contents

R for Data Science Cookbook
Credits
About the Author
About the Reviewer
www.PacktPub.com
eBooks, discount offers, and more
Why subscribe?
Preface
What this book covers
What you need for this book
Who this book is for
Sections
Getting ready
How to do it…
How it works…
There's more…
See also
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Functions in R
Introduction
Creating R functions
Getting ready
How to do it...
How it works...
There's more...
Matching arguments
Getting ready
How to do it...
How it works...
There's more...
Understanding environments
Getting ready
How to do it...
How it works...
There's more...
Working with lexical scoping
Getting ready
How to do it...
How it works...
There's more...
Understanding closure
Getting ready
How to do it...
How it works...
There's more...
Performing lazy evaluation
Getting ready
How to do it...
How it works...
There's more...
Creating infix operators
Getting ready
How to do it...
How it works...
There's more...
Using the replacement function
Getting ready
How to do it...
How it works...
There's more...
Handling errors in a function
Getting ready
How to do it...
How it works...
There's more...
The debugging function
Getting ready
How to do it...
How it works...
There's more...
2. Data Extracting, Transforming, and Loading
Introduction
Downloading open data
Getting ready
How to do it…
How it works…
There's more…
Reading and writing CSV files
Getting ready
How to do it…
How it works…
There's more…
Scanning text files
Getting ready
How to do it…
How it works…
There's more…
Working with Excel files
Getting ready
How to do it…
How it works…
Reading data from databases
Getting ready
How to do it…
How it works…
There's more…
Scraping web data
Getting ready
How to do it…
How it works…
There's more…
Accessing Facebook data
Getting ready
How to do it…
How it works…
There's more…
Working with twitteR
Getting ready
How to do it…
How it works…
There's more…
3. Data Preprocessing and Preparation
Introduction
Renaming the data variable
Getting ready
How to do it…
How it works…
There's more…
Converting data types
Getting ready
How to do it…
How it works…
There's more…
Working with the date format
Getting ready
How to do it…
How it works…
There's more…
Adding new records
Getting ready
How to do it…
How it works…
There's more…
Filtering data
Getting ready
How to do it…
How it works…
There's more…
Dropping data
Getting ready
How to do it…
How it works…
There's more…
Merging data
Getting ready
How to do it…
How it works…
There's more…
Sorting data
Getting ready
How to do it…
How it works…
There's more…
Reshaping data
Getting ready
How to do it…
How it works…
There's more…
Detecting missing data
Getting ready
How to do it…
How it works…
There's more…
Imputing missing data
Getting ready
How to do it…
How it works…
There's more…
4. Data Manipulation
Introduction
Enhancing a data.frame with a data.table
Getting ready
How to do it…
How it works…
There's more…
Managing data with a data.table
Getting ready
How to do it…
How it works…
There's more…
Performing fast aggregation with a data.table
Getting ready
How to do it…
How it works…
There's more…
Merging large datasets with a data.table
Getting ready
How to do it…
How it works…
There's more…
Subsetting and slicing data with dplyr
Getting ready
How to do it…
How it works…
There's more…
Sampling data with dplyr
Getting ready
How to do it…...

Table of contents

  1. R for Data Science Cookbook