
- 218 pages
- English
- PDF
- Available on iOS & Android
About this book
Nowadays, due to the advancement and significantly rapid growth in the technology, the collection of high-dimensional data is no longer a tedious task. Regardless of considerable advances in technology over the last few decades, the analysis of high-dimensional data faces new challenges concerning interpretation and integration. One of the major problems in high-dimensional data is the occurrence of missing values. The problem is in particular hard to handle when the distributional forms of the variables are different or the variables are measured on different measurement scales (e.g. binary, multi-categorical, continuous, etc.). Whatever the reason, missing data may occur in all areas of applied research.The inadequate handling of missing values may lead to biased results and incorrect inference. The standard statistical techniques for analyzing the data require complete cases without any missing observations. The deletion of the cases with missing information to obtain complete data will not only cause the loss of important information but can also affect inferences. In this dissertation, different imputation techniques using nearest neighbors are developed to address the missing data issues in high-dimensional as well as low dimensional data structures.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
Table of contents
- Introduction
- Methodological Concepts for Missing Data
- Improved Methods for the Imputation of Missing Data by Nearest Neighbor Methods
- Missing Value Imputation for Gene Expression Data by Tailored Nearest Neighbors
- Nearest Neighbor Imputation for Categorical Data by Weighting of Attributes
- Imputation Methods for High-Dimensional Mixed-Type Datasets by Nearest Neighbors
- Bootstrap Inference for Weighted Nearest Neighbors Imputation
- Missing Values in Classification: Improved Imputation Methods for High-Dimensional Settings
- Multiple Imputation Using Nearest Neighbor Methods
- Conclusion and Outlook
- Appendices
- Additional Results for Chapter 5
- Appendix for Chapter 7
- Appendix for Chapter 8
- Appendix for Chapter 9
- References