Cognitive Assessment
eBook - ePub

Cognitive Assessment

An Introduction to the Rule Space Method

  1. 334 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Cognitive Assessment

An Introduction to the Rule Space Method

About this book

This book introduces a new methodology for the analysis of test results. Free from ambiguous interpretations, the results truly demonstrate an individual's progress. The methodology is ideal for highlighting patterns derived from test scores used in evaluating progress. Dr. Tatsuoka introduces readers to the Rule Space Method (RSM), a technique that transforms unobservable knowledge and skill variables into observable and measurable attributes. RSM converts item response patterns into attribute mastery probabilities. RSM is the only up-to-date methodology that can handle large scale assessment for tests such as the SAT and PSAT. PSAT used the results from this methodology to create cognitively diagnostic scoring reports. In this capacity, RSM helps teachers understand what scores mean by helping them ascertain an individual's cognitive strengths and weaknesses. For example, two students may have the exact same score, but for different reasons. One student might excel at processing grammatically complex texts but miss the main idea of the prose, while another excels at understanding the global message. Such knowledge helps teachers customize a student's education to his or her cognitive abilities. RSM is also used for medical diagnoses, genetics research, and to help classify music into various states of emotions for treating mental problems.

The book opens with an overview of cognitive assessment research and nonparametric and parametric person-fit statistics. The Q-matrix theory is then introduced followed by the Rule Space method. Various properties of attribute mastery probabilities are then introduced along with the reliability theory of attributes and its connection to classical and item response theory. The book concludes with a discussion of how the construct validity of a test can be clarified with the Rule Space method.

Intended for researchers and graduate students in quantitative, educational, and cognitive psychology, this book also appeals to those in computer science, neuroscience, medicine, and mathematics. The book is appropriate for advanced courses on cognometrics, latent class structures, and advanced psychometrics as well as statistical pattern recognition and classification courses taught in statistics and/or math departments.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Cognitive Assessment by Kikumi K. Tatsuoka in PDF and/or ePUB format, as well as other popular books in Education & Education General. We have over one million books available in our catalogue for you to explore.

Information

Publisher
Routledge
Year
2009
Print ISBN
9780805828283

1

Dimensionality of Test Data and Aberrant Response Patterns

1.1 General Overview of Cognitively Diagnostic Methodologies

The value of a diagnostic profile that enumerates strengths and weaknesses in individual performance has been recognized in education, and competent teachers have been using their diagnostic skills in their classrooms to teach students better. It had been common sense that only humans could do such detective work to determine what was going on inside a human brain; however, the rapid development of computer technologies in the 1970s enabled technology to accomplish what previously had been impossible for humans. As computer technologies developed rapidly, computational powers increased dramatically. Linguists worked on natural language processing, psychologists were interested in modeling human information and retrieval, mathematicians were more interested in automating the theorem-proving process, and statisticians advanced various statistical methodologies and models that were impossible to compute without the help of computers. Computer scientists Brown and Burton (1978) developed a computer program called BUGGY using a new powerful programming language suitable for processing a list of logical statements. BUGGY was able to diagnose various “bugs,” or equally erroneous rules of operations committed by students in whole-number subtraction problems.
The successful diagnostic capability of the computer program BUGGY affected American education to a great extent; consequently, similar computer programs, called “expert systems” (Anderson, 1984), that are capable of diagnosing erroneous rules of operations or capable of teaching simple algebra and geometry were developed in the 1980s and 1990s. FBUG was a similar buggy system that followed the idea of the original BUGGY and was capable of diagnosing fraction, addition, and subtraction problems (Tatsuoka & Baillie, 1982). These computer programs required a prepared list of erroneous rules that were originally discovered by humans. Each erroneous rule was decomposed into a sequence of logical statements and a computer language like LISP, which could develop the diagnostic systems. If a new erroneous rule was discovered, then the program would be modified to include it. The system could not discover either new erroneous rules not listed in the initial list or common erroneous rules using a different strategy or a new method to solve a problem.

Stability and Gradient of Diagnostic Information

Sleeman, Kelly, Martink, Ward, and Moore (1989) developed a buggy system for algebra and discovered that many students changed their erroneous rules of operations so often that the buggy system was practically unable to diagnose such students. VanLehn (1983) developed “repair theory” to explain why bugs are unstable. Shaw (1984, 1986) interviewed 40 to 50 students in fraction, addition, and subtraction problems; Standiford, Tatsuoka, and Klein (1982) also interviewed many students for mixed-number operations; and so did Birenbaum (1981) for signed-number operations. They discovered that 95% of erroneous rules of operations in these domains were extremely unstable, and students kept changing their rules to something else. The students also could not answer interview questions as to why they changed their old rules to new ones. Moreover, many students could not even recall what rules they used, even when they used them only 10 seconds before. Tatsuoka (1984a) concluded it would not be wise to diagnose a micro level of performances like bugs or erroneous rules on a test. Consequently, an important question arose regarding the level of performance that would be stable enough to measure and diagnose.
Total scores of most large-scale assessments have high reliabilities, but the level of information is too coarse, and because there are too many different ways to get 50% of the items correct, total scores are not very useful for cognitive diagnosis. If a math test has 5 geometry items and 5 algebra items, then there are 252 ways to achieve a score of 5. Some students may get only the geometry items correct and all algebra items incorrect, whereas others get the geometry items incorrect and all algebra items correct. Their sources of misconceptions could be very different, and they would then need very different remediation treatments. The item score of a test is still at the macro level, and it is difficult to obtain useful diagnostic information from a single item. The question then becomes which levels of diagnostic information would be most valuable and helpful in promoting learning activities among students and whether a subscore level would be useful. The following problems, labeled Examples 1.1.1 and 1.1.2, were excerpted from the technical report to the National Science Foundation (Tatsuoka, Kelly, C. Tatsuoka, Varadi, & Dean, 2007) and coded by three types of attributes: content-related knowledge and skills (C2–C5 and Exponential and Probabilities), mathematical thinking skills (P1–P10), and special skills unique to item types (S1–S9).
Example 1.1
A water ski tow handle makes an isosceles triangle. If one of the congruent angles is 65 degrees, what is the measure of the angle?
image
Geometric figure is given: S3.
This is a geometry problem: C4.
Have to apply knowledge about the relationships among angles to get the solution: P3.
Because the total sum of angles is 180°, the third angle becomes (180° − 130°) = 50°. Therefore, x can be obtained by subtracting half of this angle, 25°: P5.
That is, 180° − {180° − (65° + 65°)}/2 = 155°: P2.
Example 1.2
An electrician has a plastic pipe, used for underground wiring, which is 15 feet long. He needs plastic pieces that are 6.5 inches long to complete his job. When he cuts the pipe, how many pieces will he be able to use for his job?
Two thirds of the students answered this question correctly. We counted the number of words used in the stem and found 52 words; however, the problem requires translation of a word problem into an arithmetic procedure in order to solve this item: P1.
Because two different units, feet and inch, are used, we have to convert a foot to 12 inches, and then 15 feet must be 180 inches: S1.
The length of a pipe is 6.5 inches, so we need 27 pieces, 180/6.5 = 27 pieces: P2.
Dividing 180 by a decimal number, 6.5, belongs to the content domain of C2: C2.
There are two steps—the first to convert the unit to the common unit, and the second to carry out the computation: P9.
These simple problems suggest that several different attributes listed in Table 1.1 must be applied correctly in order to get the right answer. P2 is involved in both items, but the remaining attributes coded in the problems are not intersected. There are 27 attributes listed in Table 1.1 and only 45 items per test. All attributes are involved independently in different ways for each of 45 items, and none of the items involves an identical set of attributes. The attribute involvement is intertwined and complex. The problem is to determine how one can possibly separate the items into subsets based on attribute involvement, and take their subscores from each subset as the attributes’ performance.
Table 1.1 A Modified List of Knowledge, Skill, and Process Attributes Derived to Explain Performance on Mathematics Items From the TIMSS-R (1999) for Population 2 (Eighth Graders) for Some State Assessment
Content Attributes
C1Basic concepts and operations in whole numbers and integers
C2Basic concepts and operations in fractions and decimals
EXPPowers, roots, and scientific expression of numbers are separated from C2
C3Basic concepts and operations in elementary algebra
C4Basic concepts and operations in two-dimensional geometry
C5Data and basic statistics
PROBBasic concepts, properties, and computational skills
Process Attributes
P1Translate, formulate, and understand (only for seventh graders) equations and expressions to solve a problem
P2Computational applications of knowledge in arithmetic and geometry
P3Judgmental applications of knowledge in arithmetic and geometry
P4Applying rules in algebra and solving equations (plugging in included for seventh graders)
P5Logical reasoning—includes case reasoning, deductive thinking skills, if-then, necessary and sufficient conditions, and generalization skills
P6Problem search; analytic thinking and problem restructuring; and inductive thinking
P7Generating, visualizing, and reading figures and graphs
P9Management of data and procedures, complex, and can set multigoals
P10Quantitative and logical reading (less than, must, need to be, at least, best, etc.)
Skill (Item Type) Attributes
S1Unit conversion
S2Apply number properties and relationships; number sense and number line
S3Using figures, tables, charts, and graphs
S3gUsing geometric figures
S4Approximation and estimation
S5Evaluate, verify, and check options
S6Patterns and relationships (inductive thinking skills)
S7Using proportional reasoning
S8Solving novel or unfamiliar problems
S9Comparison of two or more entities
The search for the acceptable levels for helpful and reliable diagnostic information was continued in the 1980s and early 1990s. Tatsuoka (1984a) investigated by grouping erroneous rules in fraction problems into their sources of errors, and examined their stability across two parallel tests. She determined, for example, that 16 erroneous rules of operations originated from the action of making two equivalent fractions. The sources of errors or the sources of erroneous rules of operations were acceptably stable. She further investigated the changes of error types over four parallel tests of signed-number computations (Tatsuoka, 1983a; Tatsuoka, Birenbaum, & Arnold, 1990; Tatsuoka, Birenbaum, Lewis, & Sheehan, 1993), and Birenbaum and her associates (Birenbaum, Kelly, & Tatsuoka, 1993; Birenbaum & Tatsuoka, 1980) examined the stability of computational skills in algebra and exponential items by examining the agreement of a diagnostic classification from parallel subtests (Birenbaum, Tatsuoka, & Nasser, 1997). Tatsuoka and Tatsuoka (2005) tested the stability of classification results by the rule space method and found the test–retest correlations of attribute level are higher than those of item level. This series of studies confirmed that the erroneous rules are extremely un...

Table of contents

  1. Cover
  2. Halftitle
  3. Title
  4. Copyright
  5. Dedication
  6. Contents
  7. Preface
  8. 1. Dimensionality of Test Data and Aberrant Response Patterns
  9. 2. Parametric Person–Fit Statistics, Zeta (ζ), and Generalized Zetas (ζ1, 
, ζm)
  10. 3. Cognitive Modeling by Developing an Incidence Q Matrix
  11. 4. Knowledge Space Generated From a Q Matrix
  12. 5. A Classification Space: Rule Space as a Cartesian Product of the Person Parameter Ξ in Item Response Theory, ζ, and Generalized ζs
  13. 6. Classification Rules
  14. 7. Rule Space Decision Rules and Attribute Mastery Probabilities
  15. 8. Posterior Probabilities With Different Prior Probabilities and Their Effect on the Attribute Mastery Probabilities
  16. 9. Reliabilities of Item Score, Person’s Score, and Attribute Mastery Probability, and Their Relationship to the Classical Test Theory
  17. 10. Validation of Attributes, a Q Matrix Coded by the Involvement of Attributes to Items and a Test
  18. References
  19. Author Index
  20. Subject Index