eBook - ePub

Data Mining and Exploration

Name: Data Mining and Exploration
ISBN: 9781000778076

From Traditional Statistics to Modern Data Science

Chong Ho Alex Yu,

280 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

Data Mining and Exploration

From Traditional Statistics to Modern Data Science

Chong Ho Alex Yu,

About this book

This book introduces both conceptual and procedural aspects of cutting-edge data science methods, such as dynamic data visualization, artificial neural networks, ensemble methods, and text mining. There are at least two unique elements that can set the book apart from its rivals.

First, most students in social sciences, engineering, and business took at least one class in introductory statistics before learning data science. However, usually these courses do not discuss the similarities and differences between traditional statistics and modern data science; as a result learners are disoriented by this seemingly drastic paradigm shift. In reaction, some traditionalists reject data science altogether while some beginning data analysts employ data mining tools as a "black box", without a comprehensive view of the foundational differences between traditional and modern methods (e.g., dichotomous thinking vs. pattern recognition, confirmation vs. exploration, single method vs. triangulation, single sample vs. cross-validation etc.). This book delineates the transition between classical methods and data science (e.g. from p value to Log Worth, from resampling to ensemble methods, from content analysis to text mining etc.). Second, this book aims to widen the learner's horizon by covering a plethora of software tools. When a technician has a hammer, every problem seems to be a nail. By the same token, many textbooks focus on a single software package only, and consequently the learner tends to fit the problem with the tool, but not the other way around. To rectify the situation, a competent analyst should be equipped with a tool set, rather than a single tool. For example, when the analyst works with crucial data in a highly regulated industry, such as pharmaceutical and banking, commercial software modules (e.g., SAS) are indispensable. For a mid-size and small company, open-source packages such as Python would come in handy. If the research goal is to create an executive summary quickly, the logical choice is rapid model comparison. If the analyst would like to explore the data by asking what-if questions, then dynamic graphing in JMP Pro is a better option. This book uses concrete examples to explain the pros and cons of various software applications.

Trusted by 375,005 students

Access to over 1.5 million titles for a fair monthly price.

Study more efficiently using our study tools.

Publisher

CRC Press

Year

2022

Print ISBN

9780367721466

eBook ISBN

9781000778076

Topic

Technology & Engineering

Subtopic

Data Mining

Index

Technology & Engineering

Cover Page
Title Page
Copyright Page
Dedication
Preface
Contents
1. Re-examination of Traditional Statistics
2. Why Data Science?
3. Cutting Edge Data Analytical Tools
4. Exploratory Data Analysis and Data Visualization
5. Generalized Regression Penalty against Complexity
6. Classification and Model Screening
7. Ensemble Methods The Wisdom of the Crowd
8. Dimension Reduction Breaking the Curse of Dimensionality
9. Clustering Divide and Conquer
10. Neural Networks Machines Mimic Human Intelligence
11. Text Mining Structure the Unstructured
Index

Frequently asked questions

Can I cancel at any time?

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

Can I download books?

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

What is the difference between the pricing plans?

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.5M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

How does Perlego work?

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1.5 million books across 990+ topics, we’ve got you covered! Learn about our mission

Do you support text-to-speech?

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Can I read on my tablet or smartphone?

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Is Data Mining and Exploration an online PDF/ePUB?

Yes, you can access Data Mining and Exploration by Chong Ho Alex Yu in PDF and/or ePUB format, as well as other popular books in Technology & Engineering & Data Mining. We have over 1.5 million books available in our catalogue for you to explore.

Data Mining and Exploration

From Traditional Statistics to Modern Data Science

Data Mining and Exploration

From Traditional Statistics to Modern Data Science

About this book

Trusted by 375,005 students

Information

Table of contents

Frequently asked questions