Introduction: why research the use of data in schools?
This book begins with two simple questions: how important are data in primary schools and early years settings? And what is the impact of data on these educational sites? The two research studies we discuss here explored assessment policy and practices in early years settings, and one specific policy â Baseline Assessment â which was an assessment for children aged four and five. Through both of these projects, we identified the increasing importance of data in the every-day lives of those working in primary schools and early years settings â a phenomenon we (and others) call âdataficationâ (Roberts-Holmes 2015; Williamson 2016b; Lingard, Martino and Rezai-Rashti 2013). By this we mean a change both in the classroom, where data collection drives pedagogy and dominates workloads, and in school management, where inspection is organised around a schoolâs data. In this latter vein, the basis for Baseline Assessment itself is a reliance on numerical data as a method of judging schools. We also use âdataficationâ to describe the changes in subjectivity wrought by the current obsession with data, particularly the changing role and status of teachers in the data-driven school and the way in which children come to be âmade upâ through data.1 Thus we explore, in all senses, what Ball has called âthe tyranny of numbersâ (2015). Our subtitle âplaying with numbersâ serves as a reminder that we discuss largely play-based contexts where numerical data are collected, and that numbers are something that can be manipulated.
This work is driven by the research data we collected in England, but also reflects a wider concern with the use of data in education from academics and practitioners internationally (Selwyn 2016b; Lawn 2013; Ozga et al. 2011).2 Increasingly, issues relating to data have a huge impact on education practices, yet this remains an under-researched area. Thus, we respond here to Selwynâs call to âmake visible the flow and circulation of data and begin to understand the ways in which data are then integrated back into everyday education practicesâ (2015, 76). This interest in data in education in turn reflects a wider interest in the impact of âbig dataâ in society and culture more broadly (Bowker and Star 1999; Kitchin 2014; Eynon 2013). Much of this work claims that digital technology is central to the functioning and values of society: as Beer comments, âQuestions of data, metrics, analytics and number in the cultural sphere are not marginal, rather they are central to contemporary cultural formationsâ (2013, 11). This change is based on the technological innovation of the advent of computing power; data are now cheaply produced, collected and analysed in contexts where in the past they were expensive, leading to claims of an âindustrial revolution in dataâ (Hellerstein cited in Manovich 2012). This has resulted in overwhelming volumes of digital data: âthe production of data is increasingly becoming a deluge; a wide, deep torrent of timely, varied, resolute and relational data that are relatively low in cost and, outside of business, increasingly open and accessibleâ (Kitchin 2014, xv). This volume of data is largely due to increased computing power, which has meant âwe no longer have to choose between data size and depthâ (Manovich 2012, 3); thus the age of either in-depth studies or mass surveys is over. These developments have resulted in new fields of academic study focused on the impact of data as a productive force:
This wave of interest in the constitutive power of data, software and code is increasing as it becomes clear that data are, in a number of ways, central to the make-up of contemporary social formations of different types. Despite this excellent emergent body of literature, we are still only at the foothills in our critical analysis of the role of data in culture and society.
(Beer 2015, 3)
As in many fields, the educational research world has only recently begun to consider the impact of data, and in many cases this has been limited to consideration of governance, known as âgoverning through dataâ (Ozga et al. 2011). In this text, we aim to redress this imbalance by adding a significant contribution to the limited research focused on school and classroom practices (Finn 2016; Pratt 2016).
The context for this study is the primary education system (age 4â11) and early years settings (age 2â4) in England; education in other areas of the United Kingdom is governed by devolved policies. This is pertinent as England is an extreme example which demonstrates many concerns voiced in the international literature; the education system is described as âthe most âadvancedâ in Europe in terms of data production and useâ (Ozga 2009, 149; see also Silliman 2015). Since this was written, the reliance on data has only increased with policy changes, so that the issue has attracted media attention with headlines such as âTeachers âworn out by demand for dataââ (Metro 2017). An article by a school inspector in the Times Educational Supplement commented that the guidance for Ofsted inspectors on analysing data for primary schools was 50 pages long.
Think about it: how have we reached a situation where the data associated with primary assessment need 50 pages of explanation to professionals involved in inspection? What does this say about our current data-obsessed assessment system? It says that itâs far too complicated; far too dependent on numerical data which have a spurious air of precision, reliability and validity; far too impenetrable; far too far from the judgments that teachers need to make about real childrenâs progress towards greater, genuine understanding. The guidance gives a glimpse into a parallel universe far too removed from classroom reality.
(Richards 2016)
Thus, our work provides a case study â or cautionary tale â of what happens when an education system becomes data-obsessed, and loses sight of the complexities of the identities and learning taking place in an early years or school setting. This data obsession is an essential part of the current functioning of a neoliberal education system driven by values of competition and comparison (Ball 2012a; Ball 2013a), so, for us, studying data is an important part of studying the operation of policy and how these values are both accepted and contested. As Selwyn comments, âthe discourses, practices and objects of digital data offer a direct âway inâ to many of the struggles and conflicts that now characterise contemporary educationâ (2015, 79). These struggles include social justice concerns: who is defined as what by data, and how can datafication work to solidify, reveal or challenge disparities between different groups of children?
In this introductory chapter, we consider the politics of data, how data and datafication are defined, and then attempt to map out the existing research on data and education internationally. We end the chapter with a description of the relevant policy context in England, and an outline of the structure of the book. This sets the scene for our overall argument, which has these main strands:
- datafication is productive â of particular data-driven subjectivities, including new roles and hierarchies, and reproductive of some inequalities;
- datafication is reductive â reducing complexity of learning to single numbers and defining quality through proportions;
- datafication results in increased visibility of performance and has thus become an important part of performativity;
- the attraction and danger of datafication both reside in the permanence of beliefs about accuracy and therefore the usefulness of tracking and prediction.
The politics of data
Data are3 political; they reinforce arguments, âproveâ effectiveness and demonstrate the success or failure of policy. At a local level, data determine the course of childrenâs educational trajectories and teachersâ careers, and, in England, a schoolâs rating by the Office for Standards in Education (Ofsted). Data are fundamental to the relationships between the state and schools (Ozga et al. 2011; Ozga 2016). The use of data must be seen within the political context of an international education system which is driven by neoliberal values of managerialism and accountability (Ball 2015; Lingard, Martino and Rezai-Rashti 2013; Apple 2006). The collection of data facilitates accountability at greater levels of precision and fosters increasingly reliance on numerical comparisons as the basis of assessments of quality; an âaudit societyâ (Power 2013). This makes it very attractive to those who wish to engender market-driven values of competition into the education system: data are the âideal means of bringing market values and free market mechanisms into otherwise closed public education settingsâ (Selwyn 2016a, 92). Nowhere is the competition dimension of data in education more prominent than the PISA tests (Programme for International Student Assessment), the Organisation for Economic Cooperation and Developmentâs international comparison of attainment at secondary school. The PISA results rank countries by attainment in the international tests, driving national policies, encouraging âpolicy borrowingâ from successful systems and, in the case of lower-ranking countries, generating âPISA shocksâ (Grek 2009; Sellar and Lingard 2013), which provoke public debate. In 2017 a new international comparative assessment from the OECD began to be piloted for younger children â the International Early Learning Study for four- and five-year-old pupils â which has been described as a âpre-school PISAâ (Moss, Dahlberg, Grieshaber et al. 2016). We return to this recent development in later chapters. On a national level, statutory assessment systems such as the âSatsâ tests in England, NAPLAN4 in Australia, state-wide standardised tests after the No Child Left Behind Act in the United States and the SIMCE5 in Chile provide examples of the use of data to provide parents with information to choose schools, within a marketised system. In these cases, data become powerful indicators of the âqualityâ of schools and teachers, with performance made visible.
Data in education are integral to a particular political understanding of what matters in education, based on neoliberal values of competition. Related to this is the great attraction of the apparent precision of data, a positivist discourse, within the world of education which is seen by many as âmessyâ, ad hoc or unregulated â what Biesta calls the âpseudo-security of numbersâ (2017, 317). This is part of a âbroader deference to statistical reasoning which permeates our understanding of social practicesâ (Hardy and Boyle 2011, 214). As we discuss in later chapters, the allure of reduction can be seen as relating to the uncertainties of what has been described as the âlate neoliberalâ era (McGimpsey 2017).
At an institutional level, data use in schools also has to be seen in the context of discourses of âdigital revolutionâ and the concordant disruption to established educational practice. From the 1990s on, the promise of technology to transform education became an established trope, but âit should be clear to all but the most zealous technophile that the much heralded technological transformation of schools and schooling has yet to take placeâ (Selwyn 2010, 5). This tale of âhigh tech hope and digital disappointmentâ (ibid.) has not prevented new waves of hype about the power of technology to improve education, most recently through online learning, and the continued interest of private companies in the educational market. When considering the use of data in schools, we need to acknowledge the complex interests involved in reproducing this discourse about the power of data and other digital technology to change education; as Selwyn argues, âwe need to recognize the corporate, commercial and economically-driven nature of much of the prevailing talk about disruption and deinstitutionalizationâ (2016a, 21). The scale of money described through this discourse is vast: he quotes a 2013 McKinsey Global Institute report which argued that efficient use of data could add 1 trillion dollars to the value of education each year through improved effectiveness (Selwyn 2016a, 92). Education data are big business; as we discuss in more detail in later chapters,6 the private businesses tasked with providing Baseline Assessment received a share of between ÂŁ3.5 and ÂŁ4.5 million (Heavey 2016).
Defining âdataâ
In a world where discourses of âbig dataâ and the âdata revolutionâ regularly circulate, the term âdataâ has become one which is commonly used without much thought to its exact definition. As Pratt notes, in schools the term âdataâ has been reduced to mean only numbers: âthe very meaning of the term has been commandeered to ensure only that which is enumerated counts; as if data could only be numbers and not more qualitative descriptions of the childrenâs worldâ (2016, 897). We include in our definition non-numerical forms of data (written observations, photographs, colour coding), though the majority of the data discussed do take numerical form. In the study of data beyond education, they are described as: âthe raw material produced by abstracting the world into categories, measures and other representational forms â numbers, characters, symbols, images, sounds, electromagnetic waves, bits â that constitute the building blocks from which information and knowledge are createdâ (Kitchin 2014, 1). Kitchin argues that data are: âepistemological units, made to have a representational form that enables epistemological workâ (2014, 19); the knowledge gained from data is contingent on how and why the data were collected in the first place. The simple scientific framing of data as benign, technical and neutral is mistaken; there are always influences in how data are collected, stored, analysed and used. As Gitelman and Jackson put it, data are always âcookedâ; there is no such thing as âraw dataâ, untouched by human influence (2013, 5 cited in Kitchin 2014, 20; see also Bowker 2005). This is obviously true in education, where the most common form of data collected, assessment data, are inevitably affected by the mode of assessment, as well as many other factors. There is also no neutral method of analysing data: however neutral the processes of code and algorithms may appear, they are always imbued with some underlying values about what matters in education:
The work of policymakers, education leaders and educators, the choices of parents, and the behaviour and progress of learners alike are all being sculpted or governed by technologies that are instructed by the code and algorithms written by technical experts according to particular discourses about what education is or should be.
(Williamson 2016b, 4)
This view of the technology as part of the social context â a âsociotechnical systemâ â is drawn from a range of fields including science and technology studies, software studies, geography, philosophy and sociology (Williamson 2016b). This perspective draws attention to the original context in which the technologies were produced, and then âfold back to re-shape the contexts in which they originatedâ (Williamson 2016b, 6). In this book we conceptualise data in this broader sense as a record of something which also does work itself; data are not a purely technical form of information, but a socially created set of information or knowledge which also have influence on practices and subjectivities.
Defining âdataficationâ
The use of data in education is not a new phenomenon: schools have always kept records of attendance, attainment and other practices such as punishments (Selwyn 2016a; Goldstein and Moss 2014). However, the advent of digital technology and societal expectations around the promise of this new technology to improve education have led to data having an increased prominence in schools and other educational settings â on an âan unprecedented scaleâ (Selwyn 2016a, 81). Ozga et al. describe âthe incessant production of data to monitor performance in educationâ (2011, 1).
We use the term datafication to describe this broad phenomenon of increased prominence, as it offers a shorthand for a complex process. In earlier work we have defined datafication in terms of increased significance, visibility and constant governance through dataveillance, and as being what happens when people or systems are âsubjected to the demands of data productionâ (Roberts-Holmes and Bradbury 2016). Ozga describes the use of data as a policy instrument as growing in âstrength, speed and scopeâ (2009, 150). Others have described the increased use of data in education using the âthree Vsâ (Laney 2001 cited in Selwyn 2016a, 81): volume, describing increased amounts of data produced; variety, in terms of types of data and their sources; and velocity, of both the production and processing of data. This is a useful categorisation which helps us to move forward from descriptions of âmore dataâ. It is helpful to think about how data are produced from multiple sources and have multiple forms in classrooms, how they are transferred from one site to another and with whose permission, who has the power to alter them, and of course who controls how they are processed and delivered back to serve some purpose. For us, changes in volume, variety and velocity are all component parts of datafication. We are interested in the different forms and purposes of data, such as the distinctions between âcompliance dataâ produced to fulfil a commitment (usually policy-based) and âuseful dataâ which aid learning (Selwyn, Henderson and Chao 2015). Thus this work builds on Bradburyâs previous explorations of assessment processes in early years and their complex relationships with concepts of âteacher knowledgeâ (Bradbury 2013c). We also develop our discussions on the changing status of data within schools (Bradbury and Roberts-Holmes 2016a) and the related âartefactsâ of its collection (Souto-Otero and Beneito-Montagut 2016).
However, in this book we also use the term datafication to examine the impact of data, particularly on subjectivities, and so the process described is not simply a change to what is done and how, but also a change to who people are, or who they are expected to be. As Gitelman and Jackson note, data âneed to be understood as framed and framingâ ...