
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Data Science Using Python and R
About this book
Learn data science by doing data science!
Data Science Using Python and R will get you plugged into the world's two most widespread open-source platforms for data science: Python and R.
Data science is hot. Bloomberg called data scientist "the hottest job in America." Python and R are the top two open-source data science tools in the world. In Data Science Using Python and R, you will learn step-by-step how to produce hands-on solutions to real-world business problems, using state-of-the-art techniques.
Data Science Using Python and R is written for the general reader with no previous analytics or programming experience. An entire chapter is dedicated to learning the basics of Python and R. Then, each chapter presents step-by-step instructions and walkthroughs for solving data science problems using Python and R.
Those with analytics experience will appreciate having a one-stop shop for learning how to do data science using Python and R. Topics covered include data preparation, exploratory data analysis, preparing to model the data, decision trees, model evaluation, misclassification costs, naïve Bayes classification, neural networks, clustering, regression modeling, dimension reduction, and association rules mining.
Further, exciting new topics such as random forests and general linear models are also included. The book emphasizes data-driven error costs to enhance profitability, which avoids the common pitfalls that may cost a company millions of dollars.
Data Science Using Python and R provides exercises at the end of every chapter, totaling over 500 exercises in the book. Readers will therefore have plenty of opportunity to test their newfound data science skills and expertise. In the Hands-on Analysis exercises, readers are challenged to solve interesting business problems using real-world data sets.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
Table of contents
- COVER
- TABLE OF CONTENTS
- PREFACE
- ABOUT THE AUTHORS
- ACKNOWLEDGMENTS
- Chapter 1: INTRODUCTION TO DATA SCIENCE
- Chapter 2: THE BASICS OF PYTHON AND R
- Chapter 3: DATA PREPARATION
- Chapter 4: EXPLORATORY DATA ANALYSIS
- Chapter 5: PREPARING TO MODEL THE DATA
- Chapter 6: DECISION TREES
- Chapter 7: MODEL EVALUATION
- Chapter 8: NAÏVE BAYES CLASSIFICATION
- Chapter 9: NEURAL NETWORKS
- Chapter 10: CLUSTERING
- Chapter 11: REGRESSION MODELING
- Chapter 12: DIMENSION REDUCTION
- Chapter 13: GENERALIZED LINEAR MODELS
- Chapter 14: ASSOCIATION RULES
- APPENDIX DATA SUMMARIZATION AND VISUALIZATION
- INDEX
- END USER LICENSE AGREEMENT