
- English
- PDF
- Available on iOS & Android
Foundations of Data Science
About this book
This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
Table of contents
- Cover
- Half-title
- Title page
- Copyright information
- Contents
- 1 Introduction
- 2 High-Dimensional Space
- 3 Best-Fit Subspaces and Singular Value Decomposition (SVD)
- 4 Random Walks and Markov Chains
- 5 Machine Learning
- 6 Algorithms for Massive Data Problems: Streaming, Sketching, and Sampling
- 7 Clustering
- 8 Random Graphs
- 9 Topic Models, Nonnegative Matrix Factorization, Hidden Markov Models, and Graphical Models
- 10 Other Topics
- 11 Wavelets
- 12 Background Material
- References
- Index