1
Fatness of Tail
1.1. Fat tail heuristics
Suppose the tallest person you have ever seen was 2 m (6 ft 8 in). Someday you may meet a taller person; how tall do you think that person will be, 2.1 m (7 ft)? What is the probability that the first person you meet taller than 2 m will be more than twice as tall, 13 ft 4 in? Surely, that probability is infinitesimal. The tallest person in the world, Bao Xishun of Inner Mongolia, China, is 2.36 m (or 7 ft 9 in). Before 2005, the most costly Hurricane in the US was Hurricane Andrew (1992) at 41.5 billion USD (2011). Hurricane Katrina was the next record hurricane, weighing in at 91 billion USD (2011)1. Peopleâs height is a âthin-tailedâ distribution, whereas hurricane damage is âfat-tailedâ or âheavy-tailedâ. The ways in which we reason based on historical data and the ways we think about the future are, or should be, very different depending on whether we are dealing with thin- or fat-tailed phenomena. This book provides an intuitive introduction to fat-tailed phenomena, followed by a rigorous mathematical overview of many of these intuitive features. A major goal is to provide a definition of obesity that applies equally to finite data sets and to parametric distribution functions.
Fat tails have entered popular discourse largely due to Nassim Talebâs book The Black Swan: The Impact of the Highly Improbable ([TAL 07]). The black swan is the paradigm shattering, game-changing incursion from âExtremistanâ, which confounds the unsuspecting public, the experts and especially the professional statisticians, all of whom inhabit âMediocristanâ.
Mathematicians have used at least three central definitions for tail obesity. Older texts sometime speak of âleptokurtic distributionsâ: distributions whose extreme values are âmore probable than normalâ. These are distributions with kurtosis greater than zero2, and whose tails go to zero slower than the normal distribution.
Another definition is based on the theory of
regularly varying functions and it characterizes the rate at which the probability of values greater than x go to zero as
x â â. For a large class of distributions, this rate is polynomial. Unless indicated otherwise, we will always consider non-negative random variables. Letting F denote the distribution function of random variable
X, such that
, we write
to mean
is called the
survivor function of
X. A survivor function with polynomial decay rate â
α, or, as we will say,
tail index α, has infinite kth moments for all
k >
α. The
Pareto distribution is a special case of a regularly varying distribution where
. In many cases, like the Pareto distribution, the
kth moments are infinite for all
k â„
α. Chapter 4 unravels these issues, and shows distributions for which
all moments are infinite. If we are âsufficiently closeâ to infinity to estimate the tail indices of two distributions, then we can meaningfully compare their tail heaviness by comparing their tail indices, such that many intuitive features of fat-tailed phenomena fall neatly into place.
A third definition is based on the idea that the sum of independent copies X1 + X2 + ⊠+ Xn behaves like the maximum of X1, X2,⊠Xn. Distributions satisfying
are called subexponential. Like regular variation, subexponentiality is a phenomenon that is defined in terms of limiting behavior as the underlying variable goes to infinity. Unlike regular variation, there is no such thing as an âindex of subexponentialityâ that would tell us whether one distribution is âmore subexponentialâ than another. The set of regularly varying distributions is a strict subclass of the set of subexponential distributions. Other more novel definitions are given in Chapter 4.
There is a swarm of intuitive notions regarding heavytailed phenomena that are captured to varying degrees in the different formal definitions. The main intuitions are as follows:
â the historical averages are unreliable for prediction;
â differences between successively larger observations increases;
â the ratio of successive record values does not decrease;
â the expected excess above a threshold, given that the threshold is exceeded, increases as the threshold increases;
â the uncertainty in the average of n independent variables does not converge to a normal with vanishing spread as n â â; rather, the average is similar to the original variables;
â regression coefficients which putatively explain heavy-tailed variables in terms of covariates may behave erratically.
1.2. History and data
A detailed history of fat-tailed distributions is found in [MAN 08]. Mandelbrot himself introduced fat tails into finance by showing that the change in cotton prices was heavy-tailed [MAN 63]. Since then many other examples of heavy-tailed distributions have been found, among these we find data file traffic on the Internet [CRO 97], financial market returns [RAC 03, EMB 97] and magnitudes of earthquakes and floods [LAT 08, MAL 06], to name a few.3
Data for this book were developed in the NSF project 0960865, and are available at http://www.rff.org/Events/Pages/Introduction-Climate-Change-Extreme-Events.aspx, or at public sites indicated below.
1.2.1. US flood insurance claims
US flood insurance claims data from the National Flood Insurance Program (NFIP) are compiled by state and year for the years 1980â2008; the data are in US dollars. Over this time period, there has been substantial growth in exposure to flood risk, particularly in coastal states. To remove the effect of growing exposure, the claims are divided by personal income estimates per state per year from the Bureau of Economic Accounts (BEA). Thus, we study flood claims per dollar income by state and year4.
1.2.2. US crop loss
US crop insurance indemnities paid from the US Department of Agricultureâs Risk Management Agency are compiled by state and year for the years 1980â2008; the data are in US dollars. The crop loss claims are not exposure adjusted, since a proxy for exposure is not easy to establish, and exposure growth is less of a concern5.
1.2.3. US damages and fatalities from natural disasters
The SHELDUS database, maintained by the Hazards and Vulnerability Research Group at the University of South Carolina, registers states-level damages and fatalities from weather events6. The basal estimates in SHELDUS are indications as the approach to compiling the data always employs the most conservative estimates. Moreover, when a disaster affects many states, the total damages and fatalities are apportioned equally over the affected states regardless of population or infrastructure. These data should therefore be seen as indicative rather than precise.
1.2.4. US hospital discharge bills
Billing data for hospital discharges in a northeast US states were collected over the years 2000â2008; the data are in US dollars.
1.2.5. G-Econ data
This uses the G-Econ database [NOR 06] showing the dependence of gross cell product (GCP) on geographic variables measured on a spatial scale of 1°. At 45° latitude, a 1° by 1° grid cell is [45 mi2 or [68 km]2. The size varies substantially from equator to pole. The population per grid cell varies from 0.31411 to 26,443,000. The GCP is for 1990, non-mineral, 1995 USD, converted at market exchange rates. It varies from 0.000103 to 1,155,800 USD (1995), the units are $106. The GCP per person varies from 0.00000354 to 0.905, which scales from $3.54 to $905,000. There are 27,445 grid cells. Thro...