Chapter 1
The Case for Classroom Assessment
Improving the academic achievement of Kâ12 students has been a central concern of educators in the United States since at least the early 1890s, when leaders of industry, politicians, parents, and the society at large realized that an educated populus was the closest thing a country could have to a guarantee of a bright future (Ravitch, 1983). Since that time, a wide array of educational innovations have been tried, all of which were designed to enhance student achievement. Educators have experimented with such things as changing the schedule, decreasing the student-to-teacher ratio, increasing the availability and use of technology, and so on. All of these innovations have merit. However, not even the best has demonstrated the impact on student achievement of the most intuitively important variable in the educational systemâthe classroom teacher.
Virtually every study that has examined the role of the classroom teacher in the process of educating students has come to the same straightforward conclusion: an effective teacher enhances student learning more than any other aspect of schooling that can be controlled. To illustrate, after analyzing test scores of more than 60,000 students across grades 3 through 5, researchers S. Paul Wright, Sandra Horn, and William Sanders (1997) made the following observation:
The results of this study will document that the most important factor affecting student learning is the teacher. In addition, the results show wide variation in effectiveness among teachers. The immediate and clear implication of this finding is that seemingly more can be done to improve education by improving the effectiveness of teachers than by any other single factor. Effective teachers appear to be effective with students of all achievement levels, regardless of the level of heterogeneity in their classrooms. If the teacher is ineffective, students under the teacherâs tutelage will show inadequate progress academically regardless of how similar or different they are regarding their academic achievement. (p. 63)
Other studies have corroborated the conclusions of Wright, Horn, and Sanders (for a review of other studies, see Nye, Konstantopoulos, & Hedges, 2004). Kati Haycock (1998) dramatizes the effect of a classroom teacher by comparing what one can expect from a student spending a year with the âmost effective teacherâ and the âleast effective teacherâ (for a discussion of how âmost effectiveâ and âleast effectiveâ teachers are defined, see Technical Note 1.1). Haycock explains that the most effective teacher produces an achievement gain of 52 percentile points in student achievement, whereas the least effective teacher produces a gain of only 14 percentile pointsâa difference of 38 percentile points. This finding is made even more dramatic when one realizes that it has been estimated that students gain about 6 percentile points in academic achievement simply from growing one year older and gleaning new knowledge and skill from daily life (Cahen & Davis, 1987; Hattie, 1992). The ineffective teacher adds little more than life experience.
Given the potentially strong and positive effect of a classroom teacher, a logical question is, what do highly effective teachers do? Again, many answers have been proposed, most of which focus on lists of instructional and management strategies (see Hattie, 1992; Marzano, Marzano, & Pickering, 2003; Marzano, Pickering, & Pollock, 2001). These lists emphasize the use of strategies such as well-designed practice activities, comparison activities, communicating learning goals, and using pictures, graphs, and pictographs to represent knowledge. Although it is certainly true that âhigh-yieldâ instructional strategies and classroom management strategies are a critical part of effective teaching, this book is about one aspect of teaching that is frequently overlooked in discussions of ways to enhance student achievement: classroom assessment.
To the surprise of some educators, major reviews of the research on the effects of classroom assessment indicate that it might be one of the most powerful weapons in a teacherâs arsenal. To illustrate, as a result of a synthesis of more than 250 studies, Paul Black and Dylan Wiliam (1998) describe the impact of effective classroom assessment in the following way:
The research reported here shows conclusively that formative assessment does improve learning. The gains in achievement appear to be quite considerable, and as noted earlier, amongst the largest ever reported for educational interventions. As an illustration of just how big these gains are, an effect size of 0.7 [see Technical Note 1.2 for a description of an effect size], if it could be achieved on a nationwide scale, would be equivalent to raising the mathematics attainment score of an âaverageâ country like England, New Zealand or the United States into the âtop fiveâ after the Pacific rim countries of Singapore, Korea, Japan and Hong Kong. (p. 61)
It is important to note that Black and Wiliamâs (1998) comments address formative as opposed to summative assessments. This distinction is addressed in the next section. To get a sense of Black and Wiliamâs conclusions, consider Figure 1.1 (see Technical Note 1.3 for a description of how Figure 1.1 was derived). The upper part of Figure 1.1 depicts a teacher who begins at the 50th percentile in terms of her skill at using classroom assessments and a student in her class who begins at the 50th percentile in terms of his achievement. Over time the teacher increases her effectiveness at using classroom assessment to the 84th percentile. Given Black and Wiliamâs findings, one would predict that the studentâs achievement would increase to the 63rd percentile. The lower part of Figure 1.1 represents an even more dramatic scenario. If the teacher increases from the 50th to the 99th percentile in terms of skill at using classroom assessments, one would predict the studentâs achievement to increase to the 78th percentile.
Figure 1.1. Effect of Teacherâs Increased Skill in Classroom Assessment on Student Achievement
At face value, the findings reported in Figure 1.1 are remarkableâclassroom assessment can have a dramatic influence on student achievement. Given these findings, one might be tempted to conclude that assessing students more will automatically increase their learning. Such a conclusion would be wrong. Like most things in education, classroom assessment enhances student achievement under certain conditions only. Fortunately, the research provides some guidance regarding those conditions.
A Brief Review of the Research on Classroom Assessment
Scholars have conducted many reviews of the research on classroom assessment. Some of the more comprehensive reviews are those by Natriello (1987); Fuchs and Fuchs (1986); Crooks (1988); Bangert-Drowns, Kulik, and Kulik (1991); Bangert-Drowns, Kulik, Kulik, and Morgan (1991); Kluger and DeNisi (1996); and Black and Wiliam (1998). The reviews lead to many conclusions that provide insights into effective classroom assessment; however, four generalizations are particularly germane to this book:
- Feedback from classroom assessments should give students a clear picture of their progress on learning goals and how they might improve.
- Feedback on classroom assessments should encourage students to improve.
- Classroom assessment should be formative in nature.
- Formative classroom assessments should be frequent.
Providing a Clear Picture of Progress and How to Improve
At a basic level, classroom assessment is a form of feedback to students regarding their progress, and it stands to reason that feedback will enhance learning. Indeed, as a result of reviewing almost 8,000 studies, researcher John Hattie (1992) made the following comment: âThe most powerful single modification that enhances achievement is feedback. The simplest prescription for improving education must be âdollops of feedbackââ (p. 9).
As compelling as Hattieâs comments are, all forms of feedback are not equally effective. In fact, some forms of feedback might work against learning. To illustrate, consider the research findings depicted in Figure 1.2. The figure presents findings from two major meta-analytic studiesâone by Robert Bangert-Drowns, Chen-Lin Kulik, James Kulik, and Mary Teresa Morgan (1991), which reviewed 40 studies on classroom assessment; and one by Lynn Fuchs and Douglas Fuchs (1986), which reviewed 21 studies of assessment. The findings from these two synthesis studies as depicted in Figure 1.2 help one understand this first principle of effective classroom assessment.
Figure 1.2. Findings on the Effects of Different Types of Feedback
Consider the first five rows of Figure 1.2, from the Bangert-Drowns, Kulik, Kulik, and Morgan synthesis. Row 1 indicates that when students receive feedback on a classroom assessment that simply tells them whether their answers are correct or incorrect, learning is negatively influenced. This finding is illustrated by the loss of 3 percentile points. However, when students are provided with the correct answer, learning is influenced in a positive direction. This practice is associated with a gain of 8.5 percentile points in student achievement, as shown in Row 2.
Row 3 of Figure 1.2 addresses whether students are clear about the criteria used to judge their responses. Clarity regarding scoring criteria is associated with a gain of 16 percentile points in student achievement. Row 4 reports a particularly interesting findingâproviding students with explanations as to why their responses are correct or incorrect is associated with a gain of 20 percentile points in student achievement. Finally, Row 5 indicates that asking students to continue responding to an assessment until they correctly answer the items is associated with a gain of 20 percentile points.
Rows 6 and 7 of Figure 1.2 are from the Fuchs and Fuchs (1986) study. Row 6 shows the effect of graphically displaying student results. As we shall see in subsequent chapters, displaying assessment results graphically can go a long way to helping students take control of their own learning. However, this practice can also help teachers more accurately judge studentsâ levels of understanding and skill, and it is associated with a gain of 26 percentile points in student achievement. Presumably, seeing a graphic representation of studentsâ scores provides teachers with a more precise and specific frame of reference for making decisions about next instructional steps. Finally, Row 7 addresses the manner in which the teacher interprets assessment results. If the interpretation is done by a set of ârules,â student achievement is enhanced by 32 percentile points. In Chapter 3 we will consider this issue in depth. Briefly, though, this finding implies that teachers within a school or a district should have rigorous and uniform ways of interpreting the results of classroom assessments.
Encouraging Students to Improve
One perplexing finding from the research literature is that the manner in which feedback is communicated to students greatly affects whether it has a positive or a negative effect on student achievement. This was one of the major conclusions of a meta-analysis conducted by Avraham Kluger and Angelo DeNisi (1996). After analyzing 607 experimental/control comparisons involving some 23,000 students, the researchers found that in 33 percent of the studies they examined, feedback had a negative impact on achievement. One causal factor they identified for this paradoxical effect is whether feedback encourages or discourages students. To illustrate, Kluger and DeNisi found that when assessment feedback is discouraging to students, it has an effect size of negative .14. This translates into a decrease in student achievement of 5.5 percentile points (see Technical Note 1.2 for a discussion of effect sizes).
Of course, the critical question that emerges from this finding is, what constitutes encouraging versus discouraging feedback? Kluger and DeNisi warn that this question has no simple answers, but the research provides some strong guidance. To understand the dynamics of encouraging versus discouraging feedback, we must consider two foundational aspects of motivation theoryâdrive theory and attribution theory.
Drive theory postulates that much of human motivation can be explained as a function of two competing forces, or drivesâthe striving for success and the fear of failure (Atkinson, 1957, 1964, 1987; Atkinson & Raynor, 1974). Over time, people develop tendencies toward one drive or the otherâto be either success oriented or failure avoidant. When these tendencies become habituated, they translate into strong expectations regarding new tasksâparticularly tasks that are challenging to a student.
Success-oriented students tend to be encouraged by challenges because they anticipate the positive feelings that accompany success. Failure-avoidant students tend to be discouraged by challenges because they anticipate the negative feelings that accompany failure. In fact, failure-avoidant students might use self-handicapping strategies that ensure they fail for reasons other than lack of ability. These strategies include procrastination (Rothblum, Solomon, & Murakami, 1986; Solomon & Rothblum, 1984), setting unattainable goals so that failure is ensured (Snyder, 1984), and admitting to small weaknesses or handicaps to establish an excuse for failingâestablishing an âacademic wooden legâ (Covington, 1992; Covington, Omelich, & Schwarzer, 1986).
Attribution theory provides another perspective on encouraging versus discouraging feedback. It postulates that the manner in which students explain or attribute failure and success encourages or discourages them (Weiner, 1972, 1974; Weiner et al., 1971). In general, individuals attribute their success to four causes: ability, effort, luck, and task difficulty. Of these, the attribution of effort provides the most encouragement. As Martin Covington (1992) explains:
One of the most important features of attribution theory is its focus on the role of effort in achievement. This emphasis is justified for several reasons. For one thing, if students believe their failures occur for a lack of trying, then they are more likely to remain optimistic about succeeding in the future. For another thing, trying hard is known to increase pride in success and to offset feelings of guilt at having failed. And, perhaps most important of all, the emphasis on the role of effort in achievement is justified because it is widely believed that student effort is modifiable through the actions of teachers. (p. 16)
A fairly straightforward relationship exists between attribution theory and drive theory. Specifically, students who tend to be success oriented also tend to believe in the effort attribution. They perceive that working hard will bring them success. Thus they have a way to succeed, even when faced with challenging tasks. One of the more encouraging aspects of attribution theory is that students who do not believe their efforts produce success can learn over time that they do. Martin Seligman (Seligman, 1975; Seligman, Maier, & Greer, 1968) postulates that students can even cultivate an âexplanatory styleâ that is effort oriented, if they have enough direct experience that effort produces success. Seligman refers to this dynamic as âlearned optimism.â
Drive theory and attribution theory provide plausible explanations as to why assessment feedback might be encouraging to some students and discouraging to others. Assume that a student has done poorly on an assessment. If the student is failure avoidant, the negative outcome will strengthen the studentâs belief that he cannot succeed at challenging tasks and the negative feelings associated with such tasks. This combination will most likely discourage the student. However, if the student is success oriented, poor performance on the test will not be as discouraging because the student has a strategy for improvementâto work harder.
In short, drive theory tells us that classroom assessment that is encouraging must have two characteristics. First, teachers must provide students with a way to interpret even low scores in a manner that does not imply failure. If not, failureavoidant students will continually be discouraged when they do not receive high scores. Second, teachers must provide students with evidence that effort on their part results in higher scores.
The Value of Formative Versus Summative Assessments
The terms formative and summative are frequently used in discussions of educational assessments. Actually, the concepts of formative and summative assessment when first developed had little to do with classroom assessment or even with learning.
The distinction between formative and summative assessment was first popularized by Michael Scriven in 1967 as part of an American Educational Research Association monograph series on evaluation. Scrivenâs original point was that a distinction should be made between programs that are being formulated versus programs that have evolved to their final state. Consequently, evaluation takes on different characteristics and is interpreted differently in formative versus summative situations. This distinction was soon applied to the assessment of students. Specifically, formative assessment was defined as occurring while knowledge is being learned. Summative assessment was defined as occurring at the end of a learning episodeâfor example, at the end of a course (see McMillan, 2000). More formally, Peter Airasian (1994) defines formative assessments as those that âare interactive and used primarily to form or alter an ongoing process or activity. In contrast, assessments that come at the end of a process or activity, when it is difficult to alter or rectify what has already occurred, are called summative assessmentsâ (pp. 135â136).
Although the terms formative and summative have both been widely used in literature on classroom assessment, formative assessment has received more attention in the research literature. Specifically, formative classroom assessment has been the focus of almost every major attempt to synthesize the research on classroom assessment. Recall the finding from Black and Wiliamâs (1998) synthesis of more than 250 studies that formative assessments, as opposed to summative ones, produce the more powerful effect on stude...