
- 224 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Hands on Data Science for Biologists Using Python
About this book
Hands-on Data Science for Biologists using Python has been conceptualized to address the massive data handling needs of modern-day biologists. With the advent of high throughput technologies and consequent availability of omics data, biological science has become a data-intensive field. This hands-on textbook has been written with the inception of easing data analysis by providing an interactive, problem-based instructional approach in Python programming language.
The book starts with an introduction to Python and steadily delves into scrupulous techniques of data handling, preprocessing, and visualization. The book concludes with machine learning algorithms and their applications in biological data science. Each topic has an intuitive explanation of concepts and is accompanied with biological examples.
Features of this book:
-
- The book contains standard templates for data analysis using Python, suitable for beginners as well as advanced learners.
-
- This book shows working implementations of data handling and machine learning algorithms using real-life biological datasets and problems, such as gene expression analysis; disease prediction; image recognition; SNP association with phenotypes and diseases.
-
- Considering the importance of visualization for data interpretation, especially in biological systems, there is a dedicated chapter for the ease of data visualization and plotting.
-
- Every chapter is designed to be interactive and is accompanied with Jupyter notebook to prompt readers to practice in their local systems.
Other avant-garde component of the book is the inclusion of a machine learning project, wherein various machine learning algorithms are applied for the identification of genes associated with age-related disorders. A systematic understanding of data analysis steps has always been an important element for biological research. This book is a readily accessible resource that can be used as a handbook for data analysis, as well as a platter of standard code templates for building models.
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
1
Why Learn Python
Protein = âMKLFWLLFTIGFCWAQYSSNTQQGRTSIVHLFEWRWVDIALECERYâ
Leu_contain = Protein.count(âLâ)/len(Protein)
print(Leu_contain)
Installing Python
Python 3.7.3 (v3.7.3:ef4ec6ed12, Mar 25 2019, 22:22:05) [MSC v.1916 64 bit
(AMD64)] on win32
Type âhelpâ, âcopyrightâ, âcreditsâ or âlicense()â for more information.
Âť>Instructions are typed after âÂť>â. Let us start typing our first instruction
and press enter.
Âť> print(âWelcome to Pythonâ)
Welcome to Python
Installing Anaconda Distribution
Running the Jupyter Notebook
Table of contents
- Cover
- Half Title
- Title Page
- Copyright Page
- Contents
- Preface
- Author Bio
- 1. Python: Introduction and Environment Setup
- 2. Basic Python Programming
- 3. Biopython
- 4. Python for Data Analysis
- 5. Python for Data Visualization
- 6. Principal Component Analysis
- 7. Hands-On Projects
- 8. Machine Learning and Linear Regression
- 9. Logistic Regression
- 10. K-Nearest Neighbors (K-NN)
- 11. Decision Trees and Random Forests
- 12. Support Vector Machines
- 13. Neural Nets and Deep Learning
- 14. The Machine Learning Project
- 15. Natural Language Processing
- 16. K-Means Clustering
- Index