FUNDAMENTAL CONCEPTS
Peter Drucker first spoke of the āknowledge economyā in his book The Age of Discontinuity (Drucker, 1969). The knowledge economy refers to the use of knowledge āto generate tangible and intangible value.ā Nearly 50 years later, organizations have virtually transformed themselves to meet this challenge, and data and analytics have become central to that transformation.
In this chapter, we highlight the āfundamentalsā of analytics by hopefully creating a level playing field for those interested in the moving from the concept of analytics to the practice of analytics. The fundamentals include defining both data and analytics using terms that I hope resonate. In addition, I think it is important to consider analytics in the wider context of how it is used and the value derived from these efforts. Finally, in this chapter, I relate analytics to other widely used terms as a way to find both common ground and differentiation with often-confused terminologies.
Data
Data permeates just about every part of our lives, from the digital footprint we leave with our cell phones, to health records, purchase history, and utilization of resources such as energy. While not impossible, it would require dedication and uncanny persistence to live āoff-the-gridā in this digital world. Beyond the pure generation of data, we are also voracious consumers of data, reviewing our online spending habits, monitoring our fitness regimes, and reviewing those frequent flyer points for that Caribbean vacation.
But what is data? At its most general form, data is simply information that has been stored for later use. Earliest forms of recording information might have been notches on bones (Sack, 2012). Fast forward to the 1950s, and people recorded digital information on Mylar strips (magnetic tape), then punch cards, and later disks. Modern data processing is relatively young but has set the foundation for how we think about the collection, storage, management, and use of information.
Until recently, we cataloged information that wasn't necessarily computable (e.g., videos, images); but through massive technological change, the class of āunstorableā data is quickly vanishing. Stored information, or data, is simply a model of the real world encoded in a manner that is usable, or for our purposes ācomputableā (Wolfram, 2010).
The fact that data is a persistent record or āmodelā of something that happened in the real world is an important distinction in analytics. George Box, a statistician considered by many as āone of the greatest statistical minds of the 20th centuryā (Champkin, 2013) was often quoted as saying: āAll models are wrong, but some are useful.ā All too often, we find something in the data that doesn't make sense or is just plain wrong. Remember that data has been translated from the real, physical world into something that represents the real worldāGeorge's āmodel.ā Just as the mechanical speedometer is a standard for measuring speed (and a pretty good proxy for measuring velocity), the model is really measuring tire rotation, not speed. (For those interested in a late-night distraction, I refer you to Woodford's 2016 article āSpeedometersā that explains how speedometers work.) In sum, data is stored information and serves as the foundation for all of analytics. In visual analytics, for example, we make sense out of the data using visualization techniques that enable us to perform analytical reasoning through interactive, visual interfaces.
Analytics
Analytics may be one of the most overused yet least understood terms used in business. For some, it relates to the technologies used to ābeat data into submission,ā or it is simply an extension of business intelligence and data warehousing. And yet for others, it relates to the statistical, mathematical, or quantitative methods used in the development of models.
According to Merriam-Webster (Merriam-Webster, 2017), analytics is āthe method of logical analysis.ā Dictionary.com (dictionary.com, 2017) defines analytics as āthe science of logical analysis.ā Unfortunately, both definitions use the root word of analysis in the definition, which seems a bit like cheating.
The origin of the word
analysis goes all the way back to the 1580s, where the term is rooted in Medieval Latin (anal
ticus) and Greek (anal
tikós), and means to break up or to loosen. Throughout this book, I frame analytics as
a structured approach to data-driven problem solvingāone that helps us break up problems through careful consideration of the facts.
What Is Analytics?
There has been much debate over the definition of analytics (Rose, 2016). While the purpose of this book is not to redefine or challenge anyone's definition, for the current discussion I define analytics as:
a comprehensive, data-driven strategy for problem solving
I intentionally resist using a definition that views analytics as a āprocess,ā a āscience,ā or a ādiscipline.ā Instead, I cast analytics as a comprehensive strategy, and as you will see in Part II of this book, it encompasses best practice areas that contain processes, along with roles and deliverables.
Analytics uses logic, inductive and deductive reasoning, critical thinking, and quantitative methodsāalong with dataāto examine phenomena and determine its essential features. Analytics is rooted in the scientific method (Shuttleworth, 2009), including problem identification and understanding, theory generation, hypothesis testing, and the communication of results.
Inductive Reasoning
Inductive reasoning refers to the idea that accumulated evidence is used to support a conclusion but with some level of uncertainty. ...