eBook - ePub

Age, Period and Cohort Effects

Name: Age, Period and Cohort Effects
ISBN: 9780429615061

Statistical Analysis and the Identification Problem

Andrew Bell,

238 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

Age, Period and Cohort Effects

Statistical Analysis and the Identification Problem

Andrew Bell,

About this book

Age, Period and Cohort Effects: Statistical Analysis and the Identification Problem gives a number of perspectives from top methodologists and applied researchers on the best ways to attempt to answer Age–Period–Cohort related questions about society.

Age–Period–Cohort (APC) analysis is a fundamental topic for any quantitative social scientist studying individuals over time. At the same time, it is also one of the most misunderstood and underestimated topics in quantitative methods. As such, this book is key reference material for researchers wanting to know how to deal with APC issues appropriately in their statistical modelling. It deals with the identification problem caused by the co-linearity of the three variables, considers why some currently used methods are problematic and suggests ideas for what applied researchers interested in APC analysis should do.

Whilst the perspectives are varied, the book provides a unified view of the subject in a reader-friendly way that will be accessible to social scientists with a moderate level of quantitative understanding, across the social and health sciences.

Trusted by 375,005 students

Access to over 1.5 million titles for a fair monthly price.

Study more efficiently using our study tools.

Publisher

Routledge

Year

2020

Print ISBN

9780367174422

eBook ISBN

9780429615061

Topic

Psychology

Subtopic

Research & Methodology in Psychology

Index

Psychology

1 Introducing age, period and cohort effects

Andrew Bell

Age, period and cohort (APC) effects are three ways in which societies can change over time, and as such they are of great interest to social scientists across a range of disciplines. However, despite these concepts being fundamental to much social science research, they are poorly understood – in terms of how they can be uncovered, what they really mean and even fundamentally what they are.

This book brings together a collection of perspectives on how applied social scientists should approach age, period and cohort effects. In some cases, this involves complex statistical models; in others, carefully thought through but simple models; in others, data visualization. Why the need for such a plethora of approaches for the apparently simple question of how things change over time? As we will see, the answer is that understanding age, period and cohort effects is not as simple as it may seem at first glance and, as such, attempting to empirically uncover those effects requires making decisions relating to what specifically we want to find out, what assumptions we are able to make and the nature of the data available to us.

In this chapter, I aim to introduce APC effects, both in terms of how they should be understood and the difficulties that modelling them pose. I will do so in relatively simple terms (also see Bell, 2020; Fosse & Winship, 2019 for other accessible introductions to/reviews of the subject). Hopefully, by the end of this introduction, the methodological issues that the subsequent chapters are grappling with will become clear.

What are age, period and cohort effects¹

A:I can’t seem to shake off this tired feeling. Guess I’m just getting old. [Age effect]

B:Do you think it’s stress? Business is down this year, and you’ve let your fatigue build up. [Period effect]

A:Maybe. What about you?

B:Actually, I’m exhausted too! My body feels really heavy.

A:You’re kidding. You’re still young. I could work all day long when I was your age.

B:Oh, really?

A:Yeah, young people these days are quick to whine. We were not like that. [Cohort effect]

(Suzuki, 2012, 452)

Age effects are perhaps the easiest of the APC trio to understand – as we get older, we become, say, more conservative, or more likely to die, or more religious. There might also be effects that are specific to a particular age – perhaps we become more likely to drink to excess on our 18th/21st birthday, or more likely to buy a sports car around the age of 45.

Period effects are the effect of a particular year – that is the effect of existing in a particular historical moment. The mortality rate of young men is much greater, for instance, during times of war or disease epidemics; mortality rates might also be higher during a recession, as might the likelihood of an individual holding a particular political viewpoint. These are generally associated with discrete events, although we could also imagine long-run, continuous-period effects: for instance, improvements in healthcare or in air quality over time might result in gradual reductions in mortality for all people.

Finally, cohort effects are the effect of being in a particular birth cohort, or generation. Often this is conceived of as the effect of our ‘formative years’ – that is, much of what we think, how healthy we are, and who we are, is defined by the first few years of our lives, and the effect of these early years stays with us throughout the rest of our lives. Again, these could be continuous trends, whereby successive birth cohorts experience better healthcare in their early years, which sets them up to be healthier throughout the rest of their lives. But it could also be a result of discrete events – for instance, wars, pandemics or recessions could, if lived through in our formative years, affect individuals for the rest of their lives. There is strong evidence of such effects on mortality for people born during or just before the Siege of Leningrad or the Spanish Flu pandemic. Those people had higher mortality rates many years after those events took place, because they occurred in their formative years.

In each of these cases, we have seen that APC effects can have both discrete and continuous components; indeed we may have both discrete and continuous effects of one or all of APC. The continuous components explain how things change gradually with one of APC. The discrete components express the effect of being at a particular value of one of APC (on top of any gradual change). This distinction is particularly important throughout this book.

Some readers might already be thinking about some of the conceptual difficulties with understanding and distinguishing between these three. First, all three of APC operate through other variables – that is, it isn’t a particular year that has an effect, but the war, or recession, or healthcare policies that are occurring at that time. As such, understanding APC is often only the first step in understanding what is happening. Related to this, many of those other variables could operate through more than one of APC – for instance, a war could have both a period and a cohort effect, as could changes in healthcare policies. It is also possible to imagine interaction effects between each of APC – for instance, a war might have a period effect for only people of a certain age (and gender). Given this, we can see that a simple question (“how do things change over time?”) is often not simple at all.

Different types of data and identifying APC

There has been a vast increase in the amount of longitudinal data available to researchers, which has made the prospect of empirically uncovering APC effects all the more credible. However, even with cross-sectional data (that is measured at the same time and not longitudinally), it is possible to think through some questions regarding APC. With such data, there is no variation in period, and age and cohort are exactly collinear, so that we cannot know if any differences are the result of cohort differences (when people were born), or age differences (how old people are), although we will often have a good idea based on theory or intuition. Similarly, single cohort studies, that follow a single birth cohort through their lives, have no variation in cohort, and period and age are exactly collinear (although again, we might be more likely to interpret any patterns in a particular way).

However, when analysing APC, we might group multiple cross-sectional studies, or multiple cohorts, together. Alternatively, we might have panel data, which follow the same individuals through time, but measure individuals of all ages on all occasions. In these instances, we have variation in all of APC – however, as we will see in the next section, there remains a difficulty in identifying these effects.

In all these cases, we can see why one of APC might be forgotten about. With cross-sectional data, we might forget about period and cohort, and just consider age. With panel data, we might see a square age-by-year table and think we only need to think about age and year, even though cohort varies in the data as well. Such errors can be problematic, however, and produce a less nuanced, misleading and often incorrect impression of the effects that APC have. However, attempting to consider all three of APC is also problematic, as we will see now.

The identification problem

Age, period and cohort are intrinsically linked, such that the age of any individual is equal to the year of measurement (period), minus their birth year (cohort).

Age = Period – Cohort (1.1)

This is a problem if we want to find the continuous effect of all three of these because, just like two of APC with a single cross section or cohort study, the three variables are exactly collinear. That is not to say that all three couldn’t have an effect – indeed in the previous sections we have seen plausible examples of all three of these variables. But it does mean that working out which linear effects are producing the data is often impossible from the data alone.

For instance, consider the following example of a data-generating process that might exist, to explain the changes in people’s political opinions:

R i g h t w i n g n e s s = β_{0} + 1 * A g e + 1 * P e r i o d + 1 * C o h o r t + r e s i d u a l

(1.2)

Here we have a situation where, on average, an individual becomes more right wing as they age; as time goes on (period), people generally become more right wing; and each successive generation (cohort) is more right wing than the last.

Now imagine a different data-generating process

R i g h t w i n g n e s s = β_{0} + 2 * P e r i o d + r e s i d u a l

(1.3)

Here, there is no effect of age or cohort, but a stronger effect of period. However, because Age = Period – Cohort, these two data-generating processes would produce exactly the same outcome variable – exactly the same levels of rightwingness. This is a problem if, as a researcher, we are presented with this data, since we cannot know which is true. If we fit a standard regression model, such as

R i g h t w i n g n e s s = β_{0} + β_{A} A g e + β_{P} P e r i o d + β_{C} C o h o r t + r e s i d u a l

(1.4)

the model would not be able to run, due to the exact collinearity between the three variables in the model.

Instead, we would need to make some kind of assumption, which would push our model to find one equation or the other. The problem is that the difference between these two equations is not a question of a small amount of bias. The difference in how we would interpret these two equations is huge.

Note that, alternatively, we might want to fit the model using dummy variable coding, with a variable for each of the values of age, period or cohort (less a reference category for each). Whilst it is only linear effects that are affected by the identification problem described above, and such an approach would allow non-linear, discrete effects to be modelled, using dummy variables does not solve the problem. Regardless of how we model our data, any linear components of APC effects in the data-generating process will remain in the data. The choice of model would not change the fact that the linear component of those effects would be unable to be told apart, and the model would experience the same problems of exact collinearity between the dummy variables. This is the case even if the data-generating process isn’t exactly linear. Different chapters in this book refer to models that use both linear and dummy effects of APC, but in each case, the underlying effects in the data will often be a mixture of linear, continuous effects and non-linear effects. Whilst the latter can be identified, the former cannot, unless we are willing to make some quite strong assumptions about APC.

That is the key point: we cannot identify linear trends in APC without making some quite strong assumptions about at least one of APC, and whilst we can identify non-linear patterns around those trends, they may be difficult to understand without the linear trends around which they vary. The assumptions and approaches that we choose to help us to understand these patterns will have a big effect on the results that we find. The next section outlines some of those approaches, including those demonstrated in the rest of this book.

What we should and shouldn’t do: the chapt...

Cover
Half Title
Title Page
Copyright Page
Dedication
Table of Contents
List of Contributors
1 Introducing age, period and cohort effects
2 The pros and cons of constraining variables
3 Multilevel models for age–period–cohort analysis
4 The Lexis surface: A tool and workflow for better reasoning about population data
5 Detecting the ‘black hole’ of age-period excess mortality in 25 countries1: Age–period–cohort residual analysis
6 Learning from age–period–cohort data: Bounds, mechanisms, and 2D-APC graphs
7 Modeling factors affecting age, period and cohort trends: The effect of cigarette smoking on lung cancer trends
8 Bayesian age–period–cohort models
9 Age–period–cohort analysis: What is it good for?
10 The line of solutions and understanding age–period–cohort models
Index

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.5M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1.5 million books across 990+ topics, we’ve got you covered! Learn about our mission

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Yes, you can access Age, Period and Cohort Effects by Andrew Bell in PDF and/or ePUB format, as well as other popular books in Psychology & Research & Methodology in Psychology. We have over 1.5 million books available in our catalogue for you to explore.

Age, Period and Cohort Effects

Statistical Analysis and the Identification Problem

Age, Period and Cohort Effects

Statistical Analysis and the Identification Problem

About this book

Trusted by 375,005 students

Information

1

Introducing age, period and cohort effects

What are age, period and cohort effects¹

Different types of data and identifying APC

The identification problem

What we should and shouldn’t do: the chapt...

Table of contents

Frequently asked questions

About this book

Trusted by 375,005 students

Information

What are age, period and cohort effects1

Different types of data and identifying APC

The identification problem

What we should and shouldn’t do: the chapt...

Table of contents

Frequently asked questions

What are age, period and cohort effects¹