
- 272 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Regression Models for Categorical and Count Data
About this book
This text provides practical guidance on conducting regression analysis on categorical and count data. Step by step and supported by lots of helpful graphs, it covers both the theoretical underpinnings of these methods as well as their application, giving you the skills needed to apply them to your own research. It offers guidance on:
¡      Using logistic regression models for binary, ordinal, and multinomial outcomes
¡      Applying count regression, including Poisson, negative binomial, and zero-inflated models
¡      Choosing the most appropriate model to use for your research
¡      The general principles of good statistical modelling in practice
Part of The SAGE Quantitative Research Kit, this book will give you the know-how and confidence needed to succeed on your quantitative research journey
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
1 Introduction
Chapter Overview
- Why study regression models for categorical and count data? 2
- A few words on terminology 3
- Why do we need to look beyond linear regression? 4
- Regression beyond the linear model: an illustrated introduction 4
- Linear regression: a reminder, with some mathematical notation 8
- Generalised linear models 10
- Whatâs the same and whatâs different 11
- How you might use this book 13
- Further Reading 13
- Dichotomous variables have exactly two categories. Examples are presence of an illness (the patient is either âillâ or ânot illâ) or retirement status (âretiredâ or ânot retiredâ).
- Ordinal variables have three or more categories that can be placed in a natural order. Examples are highest qualification (âno qualificationâ, âcompleted primary schoolâ, âcompleted secondary schoolâ, âuniversity degreeâ, etc.) or subjective health status based on an ordered response scale (âpoor healthâ, âfairâ, âgoodâ, âvery good healthâ).
- Nominal variables have three or more categories that cannot be placed in a meaningful order. Examples are choice of study subject (âscienceâ, âhumanitiesâ, âartsâ, etc.) or type of accommodation (ârentedâ, âowned with mortgageâ, âowned outrightâ, ânursing home or other institutionâ, âhomelessâ).
- Logistic regression for dichotomous (binary) outcomes
- The general ordered logit model for ordinal outcomes (also known as ordinal logistic regression)
- Multinomial logistic regression for nominal outcomes
- Several models for count outcomes, including Poisson and negative binomial regression, as well as zero-truncated, zero-inflated and hurdle models
Why study regression models for categorical and count data?
- Health inequalities: In England, people living in poorer areas are less likely to make use of free eye tests than people living in richer areas. Why?
- Mental health: Can we identify childhood experiences and characteristics that are associated with the risk of developing an eating disorder as an adult?
- Sociology of culture: Do people choose their cultural activities to display their social status?
- Political science: Under what circumstances are local politicians prepared to tolerate illegal street vendors in their cities?
- Sociology of religion: Is it true that people with a strong religious identity are more likely to be happy with their lives? And if so, why might that be?
A few words on terminology
Outcome and predictor
- The outcome variable is also known as the dependent variable, or the response. It is usually denoted by the letter Y.
- Predictor variables are also known as independent variables, explanatory variables, or exposures. The conventional symbol for a predictor variable is X. When there are multiple predictor variables, they are identified by numeric subscripts: X1, X2, X3, and so forth.
Types of variables
- Continuous variables: A continuous variable is a numeric variable that can take any value within its possible range. For example, age is a continuous variable: a person can be 28 years old, 28.4 years old, or even 28.397853 years old. Age changes every day, every minute, every second, so our measurement of age is limited only by how precise we can or wish to be.
- Discrete variables: A discrete variable is a numeric variable that can only take particular, âdiscreteâ values. Count variables are discrete. They can take the values 0, 1, 2, 3 and so on. Consider the count variable ânumber of childrenâ: you can have zero children, one child or seven children, but not 1.5 children.
Why do we need to look beyond linear regression?
Table of contents
- Cover
- Half Title
- Acknowledgements
- Title Page
- Copyright Page
- Contents
- Illustration List
- About the Author
- Acknowledgements
- Preface
- 1 Introduction
- 2 Logistic Regression
- 3 Ordinal Logistic Regression: The Generalised Ordered Logit Model
- 4 Multinomial Logistic Regression
- 5 Regression Models for Count Data
- 6 The Practice of Modelling
- Glossary
- References
- Index