
- 456 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
About this book
Data Science: A First Introduction focuses on using the R programming language in Jupyter notebooks to perform data manipulation and cleaning, create effective visualizations, and extract insights from data using classification, regression, clustering, and inference.
The text emphasizes workflows that are clear, reproducible, and shareable, and includes coverage of the basics of version control. All source code is available online, demonstrating the use of good reproducible project workflows.
Based on educational research and active learning principles, the book uses a modern approach to R and includes accompanying autograded Jupyter worksheets for interactive, self-directed learning. The book will leave readers well-prepared for data science projects.
The book is designed for learners from all disciplines with minimal prior knowledge of mathematics and programming. The authors have honed the material through years of experience teaching thousands of undergraduates in the University of British Columbia's DSCI100: Introduction to Data Science course.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
Table of contents
- Cover Page
- Half-Title Page
- Series Page
- Title Page
- Copyright Page
- Dedication Page
- Contents
- Foreword
- Preface
- Acknowledgments
- About the authors
- 1 R and the Tidyverse
- 2 Reading in data locally and from the web
- 3 Cleaning and wrangling data
- 4 Effective data visualization
- 5 Classification I: training & predicting
- 6 Classification II: evaluation & tuning
- 7 Regression I: K-nearest neighbors
- 8 Regression II: linear regression
- 9 Clustering
- 10 Statistical inference
- 11 Combining code and text with Jupyter
- 12 Collaboration with version control
- 13 Setting up your computer
- Bibliography
- Index