Analytics Stories
eBook - ePub

Analytics Stories

Using Data to Make Good Things Happen

Wayne L. Winston

Compartir libro
  1. English
  2. ePUB (apto para móviles)
  3. Disponible en iOS y Android
eBook - ePub

Analytics Stories

Using Data to Make Good Things Happen

Wayne L. Winston

Detalles del libro
Vista previa del libro
Índice
Citas

Información del libro

Inform your own analyses by seeinghow one of the best data analysts in the world approaches analytics problems

Analytics Stories: How to Make Good Things Happen is a thoughtful, incisive, and entertaining exploration of the application of analytics to real-world problems and situations. Covering fields as diverse as sports, finance, politics, healthcare, and business, Analytics Stories bridges the gap between theoftinscrutable world of data analytics and the concrete problems it solves.

Distinguished professor and author Wayne L. Winston answers questions like:

  • Was Liverpool over Barcelona the greatest upset in sports history?
  • Was Derek Jeter a great infielder
  • What's wrong with the NFL QB rating?
  • How did Madoff keep his fund going?
  • Does a mutual fund's past performance predict future performance?
  • What caused the Crash of 2008?
  • Can we predict where crimes are likely to occur?
  • Is the lot of the American worker improving?
  • How can analytics save the US Republic?
  • The birth of evidence-based medicine: How did James Lind know citrus fruits cured scurvy?
  • How can I objectively compare hospitals?
  • How can we predict heart attacks in real time?
  • How does a retail store know if you're pregnant?
  • How can I use A/B testing to improve sales from my website?
  • How can analytics help me write a hit song?

Perfect for anyone with the word "analyst" in their job title, Analytics Stories illuminates the process of applying analytic principles to practical problems and highlights the potential pitfalls that await careless analysts.

Preguntas frecuentes

¿Cómo cancelo mi suscripción?
Simplemente, dirígete a la sección ajustes de la cuenta y haz clic en «Cancelar suscripción». Así de sencillo. Después de cancelar tu suscripción, esta permanecerá activa el tiempo restante que hayas pagado. Obtén más información aquí.
¿Cómo descargo los libros?
Por el momento, todos nuestros libros ePub adaptables a dispositivos móviles se pueden descargar a través de la aplicación. La mayor parte de nuestros PDF también se puede descargar y ya estamos trabajando para que el resto también sea descargable. Obtén más información aquí.
¿En qué se diferencian los planes de precios?
Ambos planes te permiten acceder por completo a la biblioteca y a todas las funciones de Perlego. Las únicas diferencias son el precio y el período de suscripción: con el plan anual ahorrarás en torno a un 30 % en comparación con 12 meses de un plan mensual.
¿Qué es Perlego?
Somos un servicio de suscripción de libros de texto en línea que te permite acceder a toda una biblioteca en línea por menos de lo que cuesta un libro al mes. Con más de un millón de libros sobre más de 1000 categorías, ¡tenemos todo lo que necesitas! Obtén más información aquí.
¿Perlego ofrece la función de texto a voz?
Busca el símbolo de lectura en voz alta en tu próximo libro para ver si puedes escucharlo. La herramienta de lectura en voz alta lee el texto en voz alta por ti, resaltando el texto a medida que se lee. Puedes pausarla, acelerarla y ralentizarla. Obtén más información aquí.
¿Es Analytics Stories un PDF/ePUB en línea?
Sí, puedes acceder a Analytics Stories de Wayne L. Winston en formato PDF o ePUB, así como a otros libros populares de Business y Decision Making. Tenemos más de un millón de libros disponibles en nuestro catálogo para que explores.

Información

Editorial
Wiley
Año
2020
ISBN
9781119646044
Edición
1
Categoría
Business
Categoría
Decision Making

Part I
What Happened?

In This Part
  • Chapter 1: Preliminaries
  • Chapter 2: Was the 1969 Draft Lottery Fair?
  • Chapter 3: Who Won the 2000 Election: Bush or Gore?
  • Chapter 4: Was Liverpool Over Barcelona the Greatest Upset in Sports History?
  • Chapter 5: How Did Bernie Madoff Keep His Fund Going?
  • Chapter 6: Is the Lot of the American Worker Improving?
  • Chapter 7: Measuring Income Inequality with the Gini, Palm, and Atkinson Indices
  • Chapter 8: Modeling Relationships Between Two Variables
  • Chapter 9: Intergenerational Mobility
  • Chapter 10: Is Anderson Elementary School a Bad School?
  • Chapter 11: Value-Added Assessments of Teacher Effectiveness
  • Chapter 12: Berkeley, Buses, Cars, and Planes
  • Chapter 13: Is Carmelo Anthony a Hall of Famer?
  • Chapter 14: Was Derek Jeter a Great Fielder?
  • Chapter 15: “Drive for Show and Putt for Dough?”
  • Chapter 16: What's Wrong with the NFL QB Rating?
  • Chapter 17: Some Sports Have All the Luck
  • Chapter 18: Gerrymandering
  • Chapter 19: Evidence-Based Medicine
  • Chapter 20: How Do We Compare Hospitals?
  • Chapter 21: What is the Worst Health Care Problem in My Country?

CHAPTER 1
Preliminaries

Most applications of analytics involve looking at data relevant to the problem at hand and analyzing uncertainty inherent in the given situation. Although we are not emphasizing advanced analytics in this book, you will need an elementary grounding in probability and statistics. This chapter introduces basic ideas in statistics and probability.

Basic Concepts in Data Analysis

If you want to understand how analytics is relevant to a particular situation, you absolutely need to understand what data is needed to solve the problem at hand. Here are some examples of data that will be discussed in this book:
  • If you want to understand why Bernie Madoff should have been spotted as a fraud long before he was exposed, you need to understand the “reported” monthly returns on Madoff's investments.
  • If you want to understand how good an NBA player is, you can't just look at box score statistics; you need to understand how his team's margin moves when he is in and out of the game.
  • If you want to understand gerrymandering, you need to look at the number of Republican and Democratic votes in each of a state's congressional districts.
  • If you want to understand how income inequality varies between countries, you need to understand the distribution of income in countries. For example, what fraction of income is earned by the top 1%? What fraction is earned by the bottom 20%?
In this chapter we will focus on four questions you should ask about any data set:
  • What is a typical value for the data?
  • How spread out is the data?
  • If we plot the data in a column graph (called a histogram by analytics professionals), can we easily describe the nature of the histogram?
  • How do we identify unusual data points?
To address these issues, we will look at the two data sets listed in the file StatesAndHeights.xlsx. As shown in Figure 1.1, the Populations worksheet contains a subset of the 2018 populations of U.S. states (and the District of Columbia).
Snapshot of the U.S. state populations.
Figure 1.1: U.S. state populations
The Heights worksheet (see Figure 1.2) gives the heights of 200 adult U.S. females.
Snapshot of the heights of 200 adult U.S. women.
Figure 1.2: Heights of 200 adult U.S. women

Looking at Histograms and Describing the Shape of the Data

A histogram is a column graph in which the height of each column tells us how many data points lie in each range, or bin. Usually, we create 5–15 bins of equal length, with the bin boundaries being round numbers. Figure 1.3 shows a histogram of state populations, and Figure 1.4 shows a histogram of women's heights (in inches). Figure 1.3 makes it clear that most states have populations between 1 million and 9 million, with four states having much larger populations in excess of 19 million. When a histogram shows bars that extend much further to the right of the largest bar, we say the histogram or data set is positively skewed or skewed right.
Figure 1.4 shows that the histogram of adult women heights is symmetric, because the bars to the left of the highest bar look roughly the same as the bars to the right of the highest bar. Other shapes for histograms occur, but in most of our stories, a histogram of the relevant data would be either positively skewed or symmetric.
There is also a mathematical formula to summarize the skewness of a data set. This formula yields a skewness of 2.7 for state populations and 0.4 for women's heights. A skewness measure greater than +1 corresponds to positive skewness, a skewness between –1 and +1 corresponds to a symmetric data set, and a skewness less than –1 (a rarity) corresponds to negative skewness (meaning bars extend further to the left of the highest bar than to the right of the highest bar).
Histogram depicts the state populations.
Figure 1.3: Histogram of state populations
Histogram depicts the women's heights.
Fi...

Índice