
Data Mining and Machine Learning Applications
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Data Mining and Machine Learning Applications
About this book
DATA MINING AND MACHINE LEARNING APPLICATIONS
The book elaborates in detail on the current needs of data mining and machine learning and promotes mutual understanding among research in different disciplines, thus facilitating research development and collaboration.
Data, the latest currency of today's world, is the new gold. In this new form of gold, the most beautiful jewels are data analytics and machine learning. Data mining and machine learning are considered interdisciplinary fields. Data mining is a subset of data analytics and machine learning involves the use of algorithms that automatically improve through experience based on data.
Massive datasets can be classified and clustered to obtain accurate results. The most common technologies used include classification and clustering methods. Accuracy and error rates are calculated for regression and classification and clustering to find actual results through algorithms like support vector machines and neural networks with forward and backward propagation. Applications include fraud detection, image processing, medical diagnosis, weather prediction, e-commerce and so forth.
The book features:
- A review of the state-of-the-art in data mining and machine learning,
- A review and description of the learning methods in human-computer interaction,
- Implementation strategies and future research directions used to meet the design and application requirements of several modern and real-time applications for a long time,
- The scope and implementation of a majority of data mining and machine learning strategies.
- A discussion of real-time problems.
Audience
Industry and academic researchers, scientists, and engineers in information technology, data science and machine and deep learning, as well as artificial intelligence more broadly.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
1
Introduction to Data Mining
AbstractData mining, as its name suggests āminingā, is nothing but extracting the desired, meaningful exact information from the datasets. Its methods and algorithms help researchers and students develop the numerous applications to be used by the end-users. Its presence in the healthcare industry, marketing, scientific applications, etc., enables the end-users to extract the meaningful required information from the collection. In the initial section, we discuss KDDāknowledge discovery in the database with its different phases like data cleaning, data integration, data selection and transformation, representation. In this chapter, we give a brief introduction to data mining. Comparative discussion about classification and clustering helps the end-user to distinguish these techniques. We also discuss its applications, algorithms, etc. An introduction to a basic clustering algorithm, K-means clustering, hierarchical clustering, fuzzy clustering, and density-based clustering, will help the end-user to select a specific algorithm as per the application. In the last section of this chapter, we introduce various data mining tools like Python, Rapid Miner, and KNIME, etc., to the user to extract the required information.Keywords: Data mining, KDD, clustering, classification, Python, KNIME
1.1 Introduction
1.1.1. Data Mining
1.2 Knowledge Discovery in Database (KDD)
- Data cleaning: This step can be defined as removing irrelevant data. Removing irrelevant data is nothing but unwanted data; records can be removed. Data collection may consist of missing values which must be either needs to be removed or should impute the missing information [7].
Figure 1.1 Knowledge discovery in DatabaseāKDD. - Data integration: Data is collected from heterogeneous sources and integrated into a common source like data-warehouse (DW). A very common technique, Extract-Transform-Load (ETL), is beneficial in this regard. Integrating the data from multiple sources requires proper synchronization between the systems [2].
- Data selection & transformation: Once the required data is selected, the next task is data transformation. As its name suggests transformation, it is nothing but transforming it into the desired mining procedure [8, 9].
- Pattern evaluation: Evaluation is based on some measures; once these measures are applied, retrieved results are strictly compared/evaluated based on the stored patterns [9ā11].
- Knowledge representation: It is nothing but representing the processed data into the required formats such as tables and reports. One can say knowledge representation generates the rules, and using the exact visualization is possible [10].
1.2.1 Importance of Data Mining
- ⦠Useful in predictive analysis.
- ⦠They are storing and managing data in multidimensional systems.
- ⦠They are identifying the hidden patterns.
- ⦠Knowledge representation in desired formats, etc. [11].
1.2.2 Applications of Data Mining
- Fraud Detection
- ⦠Data mining identifies patterns, i.e...
Table of contents
- Cover
- Table of Contents
- Title Page
- Copyright
- Preface
- 1 Introduction to Data Mining
- 2 Classification and Mining Behavior of Data
- 3 A Comparative Overview of Hybrid Recommender Systems: Review, Challenges, and Prospects
- 4 Stream Mining: Introduction, Tools & Techniques and Applications
- 5 Data Mining Tools and Techniques: Clustering Analysis
- 6 Data Mining Implementation Process
- 7 Predictive Analytics in IT Service Management (ITSM)
- 8 Modified Cross-Sell Model for Telecom Service Providers Using Data Mining Techniques
- 9 Inductive Learning Including Decision Tree and Rule Induction Learning
- 10 Data Mining for Cyber-Physical Systems
- 11 Developing Decision Making and Risk Mitigation: Using CRISP-Data Mining
- 12 HumanāMachine Interaction and Visual Data Mining
- 13 MSDTrA: A Boosting Based-Transfer Learning Approach for Class Imbalanced Skin Lesion Dataset for Melanoma Detection
- 14 New Algorithms and Technologies for Data Mining
- 15 Classification of EEG Signals for Detection of Epileptic Seizure Using Restricted Boltzmann Machine Classifier
- 16 An Enhanced Security of Women and Children Using Machine Learning and Data Mining Techniques
- 17 Conclusion and Future Direction in Data Mining and Machine Learning
- Index
- End User License Agreement