Think about how often weâre exposed to data of some sort. Reports of studies in newspapers, magazines, and online provide data about people, animals, or even abstract entities such as cities, counties, or countries. Life expectancies, crime rates, pollution levels, the prevalence of diseases, unemployment rates, election results, and numerous other phenomena are presented with overwhelming frequency and in painful detail. Understanding statisticsâor at least being able to talk intelligently about percentages, means, and margins of errorâhas become nearly compulsory for the well-informed person. Yet, few people understand enough about statistics to fully grasp not only the strengths but also the weaknesses of the way data are collected and analyzed. What does it mean to say that the life expectancy in the U.S. is 78.7 years? Should we trust exit polls that claim that Wexton will win the election over Comstock by 5% (with a âmargin of errorâ of Âą 2%)? When someone claims that âtaking calcium supplements is not associated with a significantly lower risk of bone fractures in elderly women,â what are they actually saying? These questions, as well as many others, are common in todayâs world of statistical analysis and numeracy.
For the budding social or behavioral scientist, whether sociologist, psychologist, geographer, political scientist, or economist, avoiding quantitative analyses that move beyond simple statistics such as percentages, means, standard deviations, and t-tests is almost impossible. A large proportion of studies found in professional journals employ statistical models that are designed to predict or explain the occurrence of one variable with information about other variables. The most common type of prediction tool is a regression model. Many books and articles describe, for example, how to conduct a linear regression analysis (LRA) or estimate an LRM,1 which, as noted in the Preface, is designed to account for or predict the values of a single outcome variable with information from one or more explanatory variables. Students are usually introduced to this model in a second course on applied statistics, and it is the main focus of this book. Before beginning a detailed description of LRMs, though, letâs address some general issues that all researchers and consumers of statistics should bear in mind.
1The word linear is defined as âcapable of being represented by a straight line on a graphâ (Oxford English Dictionary, definition 3, https://www.oed.com). Why the definition specifies a straight line will become clear in Chapter 3.
Our Doubts are Traitors and Make Us Lose the Good We Oft Might Win2
2William Shakespeare, Measure for Measure, Act I, Scene IV.
A critical issue I hope readers will ponder as they study the material in the following chapters involves perceptions of quantitative research. Statistics has, for better or worse, been maligned by a variety of observers in recent years. For one thing, the so-called âreplication crisisâ has brought to light the problem that the results of many studies in the social and behavioral sciences cannot be confirmed by subsequent studies.3 Books with titles such as How to Lie with Statistics are also popular4 and can lend an air of disbelief to many studies that use statistical models. Researchers and statistics educators are often to blame for this disbelief. We frequently fail to impart some important caveats to students and consumers, including:
3See, for example, Ed Yong (2018), âPsychologyâs Replication Crisis Is Running Out of Excuses,â The Atlantic, November 19 (retrieved from https://www.theatlantic.com/science/archive/2018/11/psychologys-replication-crisis-real/576223).
4The bookâs cover notes that it has sold âover half a million copiesâ (see Darrell Huff (1993), How to Lie with Statistics, New York: W.W. Norton). I suspect this makes it one of the best (if not the best) selling statistics books of all time.
- A single study is never the end of the story; multiple studies are needed before we can (or should) reach defensible conclusions about social and behavioral phenomena.
- Consumers and researchers need to embrace a healthy dose of skepticism when considering the results of research studies.5 They should ask questions about how data were collected, how variables were measured, and whether the appropriate statistical methods were used. We should also realize that random or sampling âerrorâ (see Chapter 2) affects the results of even the best designed studies.
- People should be encouraged to use their common sense and reasoning skills when assessing data and the results of analyses. Although itâs important to minimize confirmation bias and similar cognitive tendencies that (mis)shape how we process and interpret information, we should still consider whether research findings are based on sound premises and follow a logical pattern given what we already know about a phenomenon.
5Healthy is the operative word here. Unfortunately, I fear that many people have become overly skeptical about the results of scientific studies, even from those that are rigorously designed and executed. In the U.S. there have been increasing skepticism, mistrust of people and institutions, and political polarization that affect worldviews and beliefs. This can motivate some to dismiss important research findings that might otherwise be beneficial (see Esteban Ortiz-Ospina and Max Roser (2019), âTrust,â at https://ourworldindata.org/trust; Gleb Tsipursky (2018), â(Dis)trust in Science,â Scientific American, July 5; and Jan Mewes et al. (2021), âExperiences Matter: A Longitudinal Study of Individual-Level Sources of Declining Social Trust in the United States,â Social Science Research, retrieved from https://doi.org/10.1016/j.ssresearch.2021.102537).
Best Statistical Practices6
6Adapted from E. Ashley Steel, Martin Liermann, and Peter Guttorp (2019), âBeyond Calculations: A Course in Statistical Thinking,â American Statistician 73(S1): 392â401.
In the spirit of these three admonitions, it is wise to heed the following advice regarding data analysis in general and regression analysis in particular.
- Plot your dataâearly and often.
- Understand that your dataset is only one of many possible sets of data that could have been observed.
- Understand the context of your datasetâwhat is the background science and how were measurements taken (for example, survey questions or direct measures)? What are the limitations of the measurement tools used to collect the data? Are some data missing? Why?
- Be thoughtful in choosing summary statistics.
- Decide early which parts of your analysis are exploratory and which parts are confirmatory, and preregister7 your hypotheses, if not formally then at least in your own mind.
- If you use p-values,8 which can provide some evidence regarding statistical results, follow these principles:
- Report effect sizes and confidence intervals (CIs);
- Consider providing graphical evidence of predicted values or effect sizes to display for your audience the magnitude of differences furnished by the analysis;
- Report the number of tests you conduct (formal and informal);
- Interpret the p-value in light of your sample size (and power);
- Donât use p-values to claim that the null hypothesis of no difference is true; and
- Consider the p-value as, at best, only one source of evidence regarding your conclusion rather than the conclusion itself.
- Consider creating customized, simulation-based statistical tests for answering your specific question with your particular dataset.
- Use simulations to understand the performance of your statistical plan on datasets like yours and to test various assumptions.
- Read results with skepticism, remembering that patterns can easily occur by chance (especially with small samples), and that unexpected results based on small sample sizes are often wrong.
- Interpret statistical results or patterns in data as being consistent or inconsistent with a conceptual model or hypothesis instead of claiming that they reveal or prove some phenomenon or relationship (see Chapter 2 for an elaboration of this recommendation).
7Preregistration is a growing trend wherein researchers publicly identify the hypotheses that guide their work early in the process. They then restrict the analysis to testing those hypotheses and not others. A common view is that researchers must distinguish the hypothesis generating process from the hypothesis testing process. Preregistration is designed to guard against âfishing expeditionsâ: the tendency to estimate several statistical models and then choose one to report that seems the most interesting or innovative. The point in practice 5 is that we should always preregister hypotheses and the conceptual models or theories that guide them, even if informally, and avoid the temptation to keep estimating statistical models until we âconfirmâ some attractive, yet post hoc, hypothesis. For additional information, see Brian A. Nosek et al. (2018), âThe Preregistration Revolution,â PNAS 115(11): 2600â2606.
8These, as well as effect sizes, CIs, and hypothesis tests, are described in detail in Chapter 2.
The material presented in the following chapters is not completely faithful to these practices. For example, we donât cover how variables are measured, hypothesis generation, or simulations (but see Appendix B), and we are at times too willing to trust p-values (see Chapter 2). These practices should, nonetheless, be at the forefront of all researchersâ minds as they consider how to plan, execute, and report their own research.
I hope readers of subsequent chapters will be comfortable thinking about the results of quantitative studies as they consider this material and as they embark on their own studies. In fact, I never wish to underemphasize the importance of careful reasoning among those assessing and using statistical techniques. Nor should we suspend our common sense and knowledge of the research literature simply because a set of numbers supports some unusual conclusion. This is not to say that statistical analysis is not valuable or that the results are generally misleading. Numerous findings from research studies that did not comport with accepted knowledge have been shown valid in subsequent studies. Statistical analyses have also led to many noteworthy discoveries in social, behavioral, and health sciences, as well as informed policy in a productive way. The point I wish to impart is that we need a combination of toolsâincluding statistical methods, a clear comprehension of previous research, and our own ideas and reasoning abilitiesâto help us understand social and behavioral issues.
Statistical Software
I have taught courses on regression models for many years. When I first started out, most social and behavioral scientists used SPSS or SAS to estimate statistical models. I had used both but became a diehard Stata user. So, after a few years teaching students to use SPSS for statistical modeling I switched to Stata. But the tide has turned and the statistical software R (www.r-project.org)âa descendant of SPlusâis on the rise in my field. I therefore opted to prepare this book using R for the analytic examples. Since this is not a book on statistical software, however, I strongly urge readers to take the necessary time to learn to use R, which, among its many capabilities, performs all of the analyses presented herein. It has a rather steep learning curve, but once youâve mastered the basics of R, estimating univariate, bivariate, and multivariable statistics, including LRMs, is a straightforward task. Learning to use R is easier if you have experience with another statistical software package such as SAS, SPSS, or Stata; but even a diligent novice can l...