
Hands-On Exploratory Data Analysis with R
Become an expert in exploratory data analysis using R packages
- 266 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Hands-On Exploratory Data Analysis with R
Become an expert in exploratory data analysis using R packages
About this book
Learn exploratory data analysis concepts using powerful R packages to enhance your R data analysis skills
Key Features
- Speed up your data analysis projects using powerful R packages and techniques
- Create multiple hands-on data analysis projects using real-world data
- Discover and practice graphical exploratory analysis techniques across domains
Book Description
Hands-On Exploratory Data Analysis with R will help you build not just a foundation but also expertise in the elementary ways to analyze data. You will learn how to understand your data and summarize its main characteristics. You'll also uncover the structure of your data, and you'll learn graphical and numerical techniques using the R language.
This book covers the entire exploratory data analysis (EDA) process—data collection, generating statistics, distribution, and invalidating the hypothesis. As you progress through the book, you will learn how to set up a data analysis environment with tools such as ggplot2, knitr, and R Markdown, using tools such as DOE Scatter Plot and SML2010 for multifactor, optimization, and regression data problems.
By the end of this book, you will be able to successfully carry out a preliminary investigation on any dataset, identify hidden insights, and present your results in a business context.
What you will learn
- Learn powerful R techniques to speed up your data analysis projects
- Import, clean, and explore data using powerful R packages
- Practice graphical exploratory analysis techniques
- Create informative data analysis reports using ggplot2
- Identify and clean missing and erroneous data
- Explore data analysis techniques to analyze multi-factor datasets
Who this book is for
Hands-On Exploratory Data Analysis with R is for data enthusiasts who want to build a strong foundation for data analysis. If you are a data analyst, data engineer, software engineer, or product manager, this book will sharpen your skills in the complete workflow of exploratory data analysis.
Trusted by 375,005 students
Access to over 1 million titles for a fair monthly price.
Study more efficiently using our study tools.
Information
Section 1: Setting Up Data Analysis Environment
- Chapter 1, Setting Up Our Data Analysis Environment
- Chapter 2, Importing Diverse Datasets
- Chapter 3, Examining, Cleaning, and Filtering
- Chapter 4, Graphically Visualize Data with ggplot2
- Chapter 5, Creating Aesthetically Pleasing Reports with Knitr and R Markdown
Setting Up Our Data Analysis Environment
- The benefits of EDA across vertical markets
- The most popular R packages for EDA
- Installing the required R packages and tools
Technical requirements
- You need to have the R language installed. Download the R installer from here: https://cran.r-‐project.org/.
- We recommend using RStudio. If you don't already have it installed, you can get it from the following link: https://www.rstudio.com/products/rstudio/download.
- Check that R and RStudio are working.
- Install the R packages required for the workshop.

- You will also need to have prior knowledge of the R programming language. Packt has a wide range of books and video titles that are available for this purpose.
- The code for this chapter is available at the following link:
https://github.com/PacktPublishing/Hands-On-Exploratory-Data-Analysis-with-R.
The benefits of EDA across vertical markets

- Pre Modeling Stage: This stage involves the manipulation of the data frame based on Data Visualization, Data Transformation, Missing Value Imputations, Outlier Detection, Feature Selection, and Dimension Reduction.
- Modeling Stage: This stage is considered as an intermediate stage that involves Continuous Regression, Ordinal Regression, Classification, Clustering, and Time Series with Survival.
- Post Modeling Stage: This stage is considered as a final stage where only output interpretation is considered on high priority. It includes the implementation of various algorithms such as clustering, classification, and regression.
Manipulating data
- readr: readr can be used to read flat, rectangular data into R. It works with both comma-separated and tab-separated values.
- readxl: We can use the readxl package to read data from MS Excel files.
- jsonlite: Web services have increasingly started to provide data in a JSON format. The jsonlite package is a good way to import this kind of data into R.
- httr, rvest: httr, and rvest are very good packages to get data from the web, either from web APIs or by web scraping.
- DBI: DBI is used to read data from relational databases into R.
Examining, cleaning, and filtering data
Table of contents
- Title Page
- Copyright and Credits
- Dedication
- About Packt
- Contributors
- Preface
- Section 1: Setting Up Data Analysis Environment
- Setting Up Our Data Analysis Environment
- Importing Diverse Datasets
- Examining, Cleaning, and Filtering
- Visualizing Data Graphically with ggplot2
- Creating Aesthetically Pleasing Reports with knitr and R Markdown
- Section 2: Univariate, Time Series, and Multivariate Data
- Univariate and Control Datasets
- Time Series Datasets
- Multivariate Datasets
- Section 3: Multifactor, Optimization, and Regression Data Problems
- Multi-Factor Datasets
- Handling Optimization and Regression Data Problems
- Section 4: Conclusions
- Next Steps
- Other Books You May Enjoy
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app