Analytics Stories
eBook - ePub

Analytics Stories

Using Data to Make Good Things Happen

Wayne L. Winston

Buch teilen
  1. English
  2. ePUB (handyfreundlich)
  3. Über iOS und Android verfügbar
eBook - ePub

Analytics Stories

Using Data to Make Good Things Happen

Wayne L. Winston

Angaben zum Buch
Buchvorschau
Inhaltsverzeichnis
Quellenangaben

Über dieses Buch

Inform your own analyses by seeinghow one of the best data analysts in the world approaches analytics problems

Analytics Stories: How to Make Good Things Happen is a thoughtful, incisive, and entertaining exploration of the application of analytics to real-world problems and situations. Covering fields as diverse as sports, finance, politics, healthcare, and business, Analytics Stories bridges the gap between theoftinscrutable world of data analytics and the concrete problems it solves.

Distinguished professor and author Wayne L. Winston answers questions like:

  • Was Liverpool over Barcelona the greatest upset in sports history?
  • Was Derek Jeter a great infielder
  • What's wrong with the NFL QB rating?
  • How did Madoff keep his fund going?
  • Does a mutual fund's past performance predict future performance?
  • What caused the Crash of 2008?
  • Can we predict where crimes are likely to occur?
  • Is the lot of the American worker improving?
  • How can analytics save the US Republic?
  • The birth of evidence-based medicine: How did James Lind know citrus fruits cured scurvy?
  • How can I objectively compare hospitals?
  • How can we predict heart attacks in real time?
  • How does a retail store know if you're pregnant?
  • How can I use A/B testing to improve sales from my website?
  • How can analytics help me write a hit song?

Perfect for anyone with the word "analyst" in their job title, Analytics Stories illuminates the process of applying analytic principles to practical problems and highlights the potential pitfalls that await careless analysts.

Häufig gestellte Fragen

Wie kann ich mein Abo kündigen?
Gehe einfach zum Kontobereich in den Einstellungen und klicke auf „Abo kündigen“ – ganz einfach. Nachdem du gekündigt hast, bleibt deine Mitgliedschaft für den verbleibenden Abozeitraum, den du bereits bezahlt hast, aktiv. Mehr Informationen hier.
(Wie) Kann ich Bücher herunterladen?
Derzeit stehen all unsere auf Mobilgeräte reagierenden ePub-Bücher zum Download über die App zur Verfügung. Die meisten unserer PDFs stehen ebenfalls zum Download bereit; wir arbeiten daran, auch die übrigen PDFs zum Download anzubieten, bei denen dies aktuell noch nicht möglich ist. Weitere Informationen hier.
Welcher Unterschied besteht bei den Preisen zwischen den Aboplänen?
Mit beiden Aboplänen erhältst du vollen Zugang zur Bibliothek und allen Funktionen von Perlego. Die einzigen Unterschiede bestehen im Preis und dem Abozeitraum: Mit dem Jahresabo sparst du auf 12 Monate gerechnet im Vergleich zum Monatsabo rund 30 %.
Was ist Perlego?
Wir sind ein Online-Abodienst für Lehrbücher, bei dem du für weniger als den Preis eines einzelnen Buches pro Monat Zugang zu einer ganzen Online-Bibliothek erhältst. Mit über 1 Million Büchern zu über 1.000 verschiedenen Themen haben wir bestimmt alles, was du brauchst! Weitere Informationen hier.
Unterstützt Perlego Text-zu-Sprache?
Achte auf das Symbol zum Vorlesen in deinem nächsten Buch, um zu sehen, ob du es dir auch anhören kannst. Bei diesem Tool wird dir Text laut vorgelesen, wobei der Text beim Vorlesen auch grafisch hervorgehoben wird. Du kannst das Vorlesen jederzeit anhalten, beschleunigen und verlangsamen. Weitere Informationen hier.
Ist Analytics Stories als Online-PDF/ePub verfügbar?
Ja, du hast Zugang zu Analytics Stories von Wayne L. Winston im PDF- und/oder ePub-Format sowie zu anderen beliebten Büchern aus Business & Decision Making. Aus unserem Katalog stehen dir über 1 Million Bücher zur Verfügung.

Information

Verlag
Wiley
Jahr
2020
ISBN
9781119646044
Auflage
1

Part I
What Happened?

In This Part
  • Chapter 1: Preliminaries
  • Chapter 2: Was the 1969 Draft Lottery Fair?
  • Chapter 3: Who Won the 2000 Election: Bush or Gore?
  • Chapter 4: Was Liverpool Over Barcelona the Greatest Upset in Sports History?
  • Chapter 5: How Did Bernie Madoff Keep His Fund Going?
  • Chapter 6: Is the Lot of the American Worker Improving?
  • Chapter 7: Measuring Income Inequality with the Gini, Palm, and Atkinson Indices
  • Chapter 8: Modeling Relationships Between Two Variables
  • Chapter 9: Intergenerational Mobility
  • Chapter 10: Is Anderson Elementary School a Bad School?
  • Chapter 11: Value-Added Assessments of Teacher Effectiveness
  • Chapter 12: Berkeley, Buses, Cars, and Planes
  • Chapter 13: Is Carmelo Anthony a Hall of Famer?
  • Chapter 14: Was Derek Jeter a Great Fielder?
  • Chapter 15: “Drive for Show and Putt for Dough?”
  • Chapter 16: What's Wrong with the NFL QB Rating?
  • Chapter 17: Some Sports Have All the Luck
  • Chapter 18: Gerrymandering
  • Chapter 19: Evidence-Based Medicine
  • Chapter 20: How Do We Compare Hospitals?
  • Chapter 21: What is the Worst Health Care Problem in My Country?

CHAPTER 1
Preliminaries

Most applications of analytics involve looking at data relevant to the problem at hand and analyzing uncertainty inherent in the given situation. Although we are not emphasizing advanced analytics in this book, you will need an elementary grounding in probability and statistics. This chapter introduces basic ideas in statistics and probability.

Basic Concepts in Data Analysis

If you want to understand how analytics is relevant to a particular situation, you absolutely need to understand what data is needed to solve the problem at hand. Here are some examples of data that will be discussed in this book:
  • If you want to understand why Bernie Madoff should have been spotted as a fraud long before he was exposed, you need to understand the “reported” monthly returns on Madoff's investments.
  • If you want to understand how good an NBA player is, you can't just look at box score statistics; you need to understand how his team's margin moves when he is in and out of the game.
  • If you want to understand gerrymandering, you need to look at the number of Republican and Democratic votes in each of a state's congressional districts.
  • If you want to understand how income inequality varies between countries, you need to understand the distribution of income in countries. For example, what fraction of income is earned by the top 1%? What fraction is earned by the bottom 20%?
In this chapter we will focus on four questions you should ask about any data set:
  • What is a typical value for the data?
  • How spread out is the data?
  • If we plot the data in a column graph (called a histogram by analytics professionals), can we easily describe the nature of the histogram?
  • How do we identify unusual data points?
To address these issues, we will look at the two data sets listed in the file StatesAndHeights.xlsx. As shown in Figure 1.1, the Populations worksheet contains a subset of the 2018 populations of U.S. states (and the District of Columbia).
Snapshot of the U.S. state populations.
Figure 1.1: U.S. state populations
The Heights worksheet (see Figure 1.2) gives the heights of 200 adult U.S. females.
Snapshot of the heights of 200 adult U.S. women.
Figure 1.2: Heights of 200 adult U.S. women

Looking at Histograms and Describing the Shape of the Data

A histogram is a column graph in which the height of each column tells us how many data points lie in each range, or bin. Usually, we create 5–15 bins of equal length, with the bin boundaries being round numbers. Figure 1.3 shows a histogram of state populations, and Figure 1.4 shows a histogram of women's heights (in inches). Figure 1.3 makes it clear that most states have populations between 1 million and 9 million, with four states having much larger populations in excess of 19 million. When a histogram shows bars that extend much further to the right of the largest bar, we say the histogram or data set is positively skewed or skewed right.
Figure 1.4 shows that the histogram of adult women heights is symmetric, because the bars to the left of the highest bar look roughly the same as the bars to the right of the highest bar. Other shapes for histograms occur, but in most of our stories, a histogram of the relevant data would be either positively skewed or symmetric.
There is also a mathematical formula to summarize the skewness of a data set. This formula yields a skewness of 2.7 for state populations and 0.4 for women's heights. A skewness measure greater than +1 corresponds to positive skewness, a skewness between –1 and +1 corresponds to a symmetric data set, and a skewness less than –1 (a rarity) corresponds to negative skewness (meaning bars extend further to the left of the highest bar than to the right of the highest bar).
Histogram depicts the state populations.
Figure 1.3: Histogram of state populations
Histogram depicts the women's heights.
Fi...

Inhaltsverzeichnis