Data Mining and Machine Learning Applications
eBook - ePub

Data Mining and Machine Learning Applications

  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

Data Mining and Machine Learning Applications

About this book

DATA MINING AND MACHINE LEARNING APPLICATIONS

The book elaborates in detail on the current needs of data mining and machine learning and promotes mutual understanding among research in different disciplines, thus facilitating research development and collaboration.

Data, the latest currency of today's world, is the new gold. In this new form of gold, the most beautiful jewels are data analytics and machine learning. Data mining and machine learning are considered interdisciplinary fields. Data mining is a subset of data analytics and machine learning involves the use of algorithms that automatically improve through experience based on data.

Massive datasets can be classified and clustered to obtain accurate results. The most common technologies used include classification and clustering methods. Accuracy and error rates are calculated for regression and classification and clustering to find actual results through algorithms like support vector machines and neural networks with forward and backward propagation. Applications include fraud detection, image processing, medical diagnosis, weather prediction, e-commerce and so forth.

The book features:

  • A review of the state-of-the-art in data mining and machine learning,
  • A review and description of the learning methods in human-computer interaction,
  • Implementation strategies and future research directions used to meet the design and application requirements of several modern and real-time applications for a long time,
  • The scope and implementation of a majority of data mining and machine learning strategies.
  • A discussion of real-time problems.

Audience

Industry and academic researchers, scientists, and engineers in information technology, data science and machine and deep learning, as well as artificial intelligence more broadly.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Data Mining and Machine Learning Applications by Rohit Raja,Kapil Kumar Nagwanshi,Sandeep Kumar,K. Ramya Laxmi in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Warehousing. We have over one million books available in our catalogue for you to explore.

Information

1
Introduction to Data Mining

Santosh R. Durugkar1, Rohit Raja2, Kapil Kumar Nagwanshi3* and Sandeep Kumar4
1Amity University Rajasthan, Jaipur, India
2IT Department, GGV Bilaspur Central University, Bilaspur, India
3ASET, Amity University Rajasthan, Jaipur, India
4Computer Science and Engineering Department, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Andra Pradesh, India
Abstract
Data mining, as its name suggests ā€œminingā€, is nothing but extracting the desired, meaningful exact information from the datasets. Its methods and algorithms help researchers and students develop the numerous applications to be used by the end-users. Its presence in the healthcare industry, marketing, scientific applications, etc., enables the end-users to extract the meaningful required information from the collection. In the initial section, we discuss KDD—knowledge discovery in the database with its different phases like data cleaning, data integration, data selection and transformation, representation. In this chapter, we give a brief introduction to data mining. Comparative discussion about classification and clustering helps the end-user to distinguish these techniques. We also discuss its applications, algorithms, etc. An introduction to a basic clustering algorithm, K-means clustering, hierarchical clustering, fuzzy clustering, and density-based clustering, will help the end-user to select a specific algorithm as per the application. In the last section of this chapter, we introduce various data mining tools like Python, Rapid Miner, and KNIME, etc., to the user to extract the required information.
Keywords: Data mining, KDD, clustering, classification, Python, KNIME

1.1 Introduction

1.1.1. Data Mining

ā€˜Mining’—extracts the meaningful information from the databases. This method helps the researchers, students, and other IT professionals remove the exact significant details and develop the desired applications [1, 2]. It is also known as Knowledge Discovery from databases—KDD. The applications of KDD may include medical/hospitals, Marketing, Educational systems, Scientific applications, E-commerce, Retail industries, Biological analysis, Counterterrorism, use in data-warehouse, in the energy sector for decision making, Spatial data mining, and Logistics [4–6].

1.2 Knowledge Discovery in Database (KDD)

It helps detect the new patterns of previously unknown data, i.e., extracting the hidden patterns, data from the massive volume of datasets [3, 6]. Figure 1.1 gives an idea about Knowledge discovery in Database—KDD, which consists of the following phases:
  • Data cleaning: This step can be defined as removing irrelevant data. Removing irrelevant data is nothing but unwanted data; records can be removed. Data collection may consist of missing values which must be either needs to be removed or should impute the missing information [7].
    Schematic illustration of the knowledge discovery in DatabaseĀ—KDD.
    Figure 1.1 Knowledge discovery in Database—KDD.
  • Data integration: Data is collected from heterogeneous sources and integrated into a common source like data-warehouse (DW). A very common technique, Extract-Transform-Load (ETL), is beneficial in this regard. Integrating the data from multiple sources requires proper synchronization between the systems [2].
  • Data selection & transformation: Once the required data is selected, the next task is data transformation. As its name suggests transformation, it is nothing but transforming it into the desired mining procedure [8, 9].
  • Pattern evaluation: Evaluation is based on some measures; once these measures are applied, retrieved results are strictly compared/evaluated based on the stored patterns [9–11].
  • Knowledge representation: It is nothing but representing the processed data into the required formats such as tables and reports. One can say knowledge representation generates the rules, and using the exact visualization is possible [10].

1.2.1 Importance of Data Mining

  • ā—¦ Useful in predictive analysis.
  • ā—¦ They are storing and managing data in multidimensional systems.
  • ā—¦ They are identifying the hidden patterns.
  • ā—¦ Knowledge representation in desired formats, etc. [11].

1.2.2 Applications of Data Mining

  • Fraud Detection
    • ā—¦ Data mining identifies patterns, i.e...

Table of contents

  1. Cover
  2. Table of Contents
  3. Title Page
  4. Copyright
  5. Preface
  6. 1 Introduction to Data Mining
  7. 2 Classification and Mining Behavior of Data
  8. 3 A Comparative Overview of Hybrid Recommender Systems: Review, Challenges, and Prospects
  9. 4 Stream Mining: Introduction, Tools & Techniques and Applications
  10. 5 Data Mining Tools and Techniques: Clustering Analysis
  11. 6 Data Mining Implementation Process
  12. 7 Predictive Analytics in IT Service Management (ITSM)
  13. 8 Modified Cross-Sell Model for Telecom Service Providers Using Data Mining Techniques
  14. 9 Inductive Learning Including Decision Tree and Rule Induction Learning
  15. 10 Data Mining for Cyber-Physical Systems
  16. 11 Developing Decision Making and Risk Mitigation: Using CRISP-Data Mining
  17. 12 Human–Machine Interaction and Visual Data Mining
  18. 13 MSDTrA: A Boosting Based-Transfer Learning Approach for Class Imbalanced Skin Lesion Dataset for Melanoma Detection
  19. 14 New Algorithms and Technologies for Data Mining
  20. 15 Classification of EEG Signals for Detection of Epileptic Seizure Using Restricted Boltzmann Machine Classifier
  21. 16 An Enhanced Security of Women and Children Using Machine Learning and Data Mining Techniques
  22. 17 Conclusion and Future Direction in Data Mining and Machine Learning
  23. Index
  24. End User License Agreement