Expect the Unexpected
eBook - ePub

Expect the Unexpected

A First Course in Biostatistics

Raluca Balan, Gilles Lamothe;;;

Partager le livre
  1. 316 pages
  2. English
  3. ePUB (adapté aux mobiles)
  4. Disponible sur iOS et Android
eBook - ePub

Expect the Unexpected

A First Course in Biostatistics

Raluca Balan, Gilles Lamothe;;;

DĂ©tails du livre
Aperçu du livre
Table des matiĂšres
Citations

À propos de ce livre

-->

This textbook introduces the basic concepts from probability theory and statistics which are needed for statistical analysis of data encountered in the biological and health sciences. No previous study is required. Advanced mathematical tools, such as integration and differentiation, are kept to a minimum. The emphasis is put on the examples. Probabilistic methods are discussed at length, but the focus of this edition is on statistics.

The examples are kept simple, so that the reader can learn quickly and see the usefulness of various statistical and probabilistic methods. Some of the examples used in this book draw attention to various problems related to environmental issues, climate change, loss of bio-diversity, and their impact on wildlife and humans.

In comparison with the first edition of the book, this second edition contains additional topics such as power, sample size computation and non-parametric methods, and includes a large collection of new problems, as well as the answers to odd-numbered problems. Several sections of this edition are accompanied by instructions using the programming language R for statistical computing and graphics.

-->

Request Inspection Copy


--> Contents:

  • Probability:
    • Introduction to Probability
    • Axioms of Probability
    • Conditional Probability
    • Discrete Random Variables
    • Continuous Random Variables
    • Supplementary Problems (Probability)
  • Statistics:
    • Introduction to Statistics
    • Confidence Intervals
    • Hypothesis Testing
    • Comparison of Two Independent Samples
    • Paired Samples
    • Categorical Data
    • Regression and Correlation
    • Supplementary Problems (Statistics)
  • Additional Topics:
    • Sample Size and Power
    • Non-Parametric Methods
    • Answers to Odd-Numbered Problems
    • Tables

-->
--> Readership: Undergraduates studying probability and statistics, especially in the life and natural sciences. -->
Probability Theory;Statistical Inference;Biostatistics;Statistics;Calculus-Based Statistics;Genetics;Biology;Ecology;Health;Life Sciences;Natural Sciences Key Features:

  • This edition contains a large collection of new problems and includes the answers to odd-numbered problems
  • Several sections of the book are accompanied by a technology component containing instructions using the programming language R for statistical computing and graphics
  • Although probabilistic methods are discussed at length in the book, the focus of this edition is on statistics, with a new chapter dedicated to non-parametric methods in statistics

Foire aux questions

Comment puis-je résilier mon abonnement ?
Il vous suffit de vous rendre dans la section compte dans paramĂštres et de cliquer sur « RĂ©silier l’abonnement ». C’est aussi simple que cela ! Une fois que vous aurez rĂ©siliĂ© votre abonnement, il restera actif pour le reste de la pĂ©riode pour laquelle vous avez payĂ©. DĂ©couvrez-en plus ici.
Puis-je / comment puis-je télécharger des livres ?
Pour le moment, tous nos livres en format ePub adaptĂ©s aux mobiles peuvent ĂȘtre tĂ©lĂ©chargĂ©s via l’application. La plupart de nos PDF sont Ă©galement disponibles en tĂ©lĂ©chargement et les autres seront tĂ©lĂ©chargeables trĂšs prochainement. DĂ©couvrez-en plus ici.
Quelle est la différence entre les formules tarifaires ?
Les deux abonnements vous donnent un accĂšs complet Ă  la bibliothĂšque et Ă  toutes les fonctionnalitĂ©s de Perlego. Les seules diffĂ©rences sont les tarifs ainsi que la pĂ©riode d’abonnement : avec l’abonnement annuel, vous Ă©conomiserez environ 30 % par rapport Ă  12 mois d’abonnement mensuel.
Qu’est-ce que Perlego ?
Nous sommes un service d’abonnement Ă  des ouvrages universitaires en ligne, oĂč vous pouvez accĂ©der Ă  toute une bibliothĂšque pour un prix infĂ©rieur Ă  celui d’un seul livre par mois. Avec plus d’un million de livres sur plus de 1 000 sujets, nous avons ce qu’il vous faut ! DĂ©couvrez-en plus ici.
Prenez-vous en charge la synthÚse vocale ?
Recherchez le symbole Écouter sur votre prochain livre pour voir si vous pouvez l’écouter. L’outil Écouter lit le texte Ă  haute voix pour vous, en surlignant le passage qui est en cours de lecture. Vous pouvez le mettre sur pause, l’accĂ©lĂ©rer ou le ralentir. DĂ©couvrez-en plus ici.
Est-ce que Expect the Unexpected est un PDF/ePUB en ligne ?
Oui, vous pouvez accĂ©der Ă  Expect the Unexpected par Raluca Balan, Gilles Lamothe;;; en format PDF et/ou ePUB ainsi qu’à d’autres livres populaires dans MatemĂĄticas et Probabilidad y estadĂ­stica. Nous disposons de plus d’un million d’ouvrages Ă  dĂ©couvrir dans notre catalogue.

Informations

Éditeur
WSPC
Année
2017
ISBN
9789813209084

PART 2

Statistics

Chapter 7

Introduction to Statistics

Statistics is one of the oldest disciplines in science, whose origins can be traced back to the 17th century when the British administration needed a tool for analyzing various demographic and economical data. The scope of the discipline became larger in the 19th century to include the analysis of data in general. Today, statistics is employed by people working in diverse fields, like economics, engineering, social sciences, and natural sciences.
In this chapter, we discuss several methods for analyzing data, using numerical summaries and graphical tools. We emphasize the distinction between a population and a random sample from a population. We explain how a random sample can be used to estimate population parameters, and discuss ways to measure the estimation error. Finally, we end this chapter with a discussion on the sampling distribution of estimators. We also give the Central Limit Theorem which states that the distribution of a sample mean can be approximated by a normal distribution.

7.1 Random Sampling and Data Description

In this section, we learn to describe data using numerical summaries (called descriptive statistics) and graphical representations. We consider the data as observations from a random variable. The set of these observations is called a random sample. The techniques that we use to describe the sample depend on the variable type.
If the values of the variable represent categories, then we say that the variable is categorical. The table below contains examples of categorical variables.
Variable Categories
color of pea pod yellow, green
type of fish Northern pike, Rainbow trout, Catfish
height small, medium, large
A variable is called quantitative (or numerical) if it represents a numerical quantity. Temperature (in Kelvin), surface area in (cm2), volume (in m3), height (in cm), and number of diseased individuals, are examples of quantitative variables.
For categorical variables an easy and effective way to describe the data is to display a frequency distribution or a relative frequency distribution. When defining the categories one has to be careful in defining mutually exclusive classes, otherwise the relative frequencies do not add up to 1. The (relative) frequency distribution can be displayed as a table, or graphically, as a bar chart.
Example 7.1. A fish tumor survey was conducted in a particular river system. Of particular interest were liver tumors and tumors in the mouth. A random sample of n = 123 fish were captured, classified and released. The frequency distribution is displayed below in tabular form and as a bar chart in Figure 7.1.
Tumor Classification
Frequency
Relative Frequency
only liver
35
28.5%
only mouth
10
8.1%
both
3
2.4%
no tumors
75
61.0%
Total
123
100%
Many biological studies are comparative in nature. These studies usually involve two or more variables. In the case of two categorical variables, we can start by cross-classifying the observations according to the joint categories of the two variables. The resulting table is called a contingency table and it displays the joint (relative) frequency distribution of the two variables.
To describe the association between the two variables, we can compute conditional relative frequency distributions for one of the variables conditioned on the categories of the other variables. The conditional relative frequency distribution can be displayed as a side-by-side bar chart.
images
Fig. 7.1 Distribution of fish tumors
Example 7.2. Consider a fish tumor survey similar to Example 7.1. We would like to compare the fish tumor distributions in two river systems. A summary of the data is found in the following contingency table, which is a cross-classification of the fish according to the tumor category and the river systems. Each cell represents a joint frequency. In the parenthesis, we computed the conditional relative frequency for the tumour variable conditioned on the river system.
images
In Figure 7.2, we find a side-by-side bar chart of the conditional distributions for tumor. The distribution of tumors do appear to be heterogeneous. In fact, it appears that fish from the second river system are more likely to have no tumors.
The frequency distribution is an important tool to describe the random sample from a quantitative variable. The frequency distribut...

Table des matiĂšres