YOUNG CHILDREN’S STATISTICAL THINKING: A TEACHING EXPERIMENT

 

Arsalan  Wares

Graham A. Jones

Cynthia W. Langrall

Carol A. Thornton

Illinois State University

Illinois State University

Illinois State University

Illinois State University

wares@ilstu.edu

jones@ilstu.edu

langrall@ilstu.edu

thornton@ilstu.edu

This study designed and evaluated a teaching experiment in data exploration with two grade 1 classes one of which collected the data used in instruction. The teaching experiment was informed by a cognitive framework that described elementary students’ statistical thinking. Following the teaching experiment, the children showed significant gains on some but not all of the statistical thinking processes associated with the framework. While the two classes (collection and noncollection groups) changed in different ways, the evidence did not support stronger overall growth for the collection group. Case-study analysis revealed that: experience with the data context reduced children’s idiosyncratic descriptions, data values of zero were problematic for these children; children possess intuitive knowledge of center and spread; and making meaningful predictions from data was difficult for these children.

In response to the critical role that data play in our technological society, there have been ongoing calls for reform in statistical education beginning in the primary grades (NCTM, 2000). Notwithstanding these recommendations there has been relatively little research on primary children's statistical thinking and even less research on the efficacy of instructional programs in data exploration (Shaughnessy, Garfield, & Greer, 1996). Moreover, the studies that have been undertaken have not developed and used the kind of cognitive models that researchers like Fennema et al. (1996) deem necessary to guide the design and implementation of instruction.

This study addressed the above-mentioned void in the research literature by developing and evaluating a teaching experiment on data handling with young children. More specifically, the study sought to: (a) use a cognitive framework that describes students’ statistical thinking to design and implement a teaching experiment with two grade 1 classes, and (b) to evaluate the effect of the teaching experiment on children’s learning.

Theoretical Perspectives

The conceptualization of this study drew on two theoretical perspectives. First, it was grounded in teaching experiment theory (Cobb, 1999). Second, the teaching experiment was informed by a cognitive Framework (Jones et al., 2000) (Figure 1) that describes students’ statistical thinking. In making the link between these two perspectives the Statistical Thinking Framework served as the research base for designing a hypothetical learning trajectory (Simon,1995) for students as they engaged in the teaching experiment. We also used the Framework to interpret classroom events and

 


 

Process/ Level

 

Level 1

Idiosyncratic

 

Level 2

Transitional

 

Level 3

Quantitative

 

Level 4

Analytical

 

 

Describing Data

Displays

 

(D)

 

•gives a description that is unfocused and includes idiosyncratic/irrelevant information; has no awareness of graphing conventions [e.g., title, axis labels] of the display

 

•does not recognize when two displays represent the same data OR indicates some recognition but uses idiosyncratic/ irrelevant reasoning

 

•considers irrelevant/subjective features when evaluating the effectiveness of two different displays of the same data

•gives a description that is hesitant and incomplete, but demonstrates some awareness of graphing conventions

 

 

•recognizes when two different displays represent the same data, but uses a justification based purely on conventions

 

 

•focuses only on one aspect when evaluating the effectiveness of two different displays of the same data

•gives a confident and complete description and demonstrates awareness of graphing conventions

 

 

•recognizes when two different displays represent the same data by establishing partial correspondences between data elements in the displays

 

•focuses on more than one aspect when evaluating the effectiveness of two different displays of the same data

 

 

 

 

•recognizes when two different displays represent the same data by establishing precise numerical correspondences between data elements in the displays

 

•provides a coherent and comprehensive explanation when evaluating the pros and cons of two different displays of the same data

 

 

Organizing

and

Reducing Data

 

(O)

•does not group or order the data or gives an idiosyncratic/

irrelevant grouping

 

•does not recognize when  information is lost in reduction process

 

•is not able to describe data in terms of representativeness or "typicality"

 

•cannot describe data in terms of spread; gives idiosyncratic / irrelevant responses

•gives a grouping or ordering that is not consistent OR groups data into classes using criteria they cannot explain

 

•recognizes when data reduction occurs, but gives a vague/irrelevant explanation

 

•gives hesitant and incomplete descriptions of data in terms of "typicality"

 

•invents a measure--usually invalid--in an effort to make sense of spread

•groups or orders data into classes and can explain the basis for grouping

 

 

•recognizes when data reduction occurs and explains  reasons for the reduction

 

•gives valid measures of "typicality" that begin to approximate one of the centers (mode, median, mean); reasoning is incomplete

 

•uses an invented measure or description  which is valid, but the explanation is incomplete

•groups or orders data into classes in more than one way and can explain the bases for these different groupings

 

•recognizes that data reduction can occur in different ways and gives complete explanations for the different reductions

 

•gives valid measures of typicality that reflect one or more of the centers; reasoning is essentially complete

 

•uses the range or invented measure that has the same meaning as range

 

 

Representing Data

 

(R)

 

•constructs an idiosyncratic or invalid display when asked to complete a partially constructed graph associated with a given data set

 

•produces an idiosyncratic or invalid display that does not represent or reorganize the data set

•constructs a display that is valid in some aspects when asked to complete a partially constructed graph associated with a given data set

 

•produces a display that is partially  valid,  but does not attempt to reorganize the data

•constructs a display that is valid when asked to complete a partially constructed graph associated with a given data set; may have difficulty with ideas like scale or zero categories

 

•produces a valid display that shows some attempt to reorganize the data

•constructs a valid display when asked to complete a partially constructed graph associated with a given data set; works effectively with scale, zero categories, 

 

•produces multiple valid displays, some of which reorganize the data

 

 

Analyzing and

Interpreting Data

 

(A)

 

•makes no response or an invalid/irrelevant response to the question, "What does the display not say about the data?"

 

•makes no response or gives an invalid/incomplete response when asked to "read between the data"

 

•makes no response or gives an invalid/ incomplete response when asked to "read beyond the data"

•makes a relevant but incomplete response to the question, "What does the display not say about the data?"

 

•gives a valid response  to some aspects of "reading between the data" but is imprecise when asked to make comparisons

 

•gives a vague or inconsistent response when asked to "read beyond the data"

•makes multiple relevant responses to the question, "What does the display not say about the data?"

 

•gives multiple valid responses when asked to "read between the data" and can make some global comparisons

 

•tries to use the data and make sense of the situation when asked to "read beyond the data;" reasoning is incomplete

•makes a comprehensive contextual response  to the question, "What does the display not say about the data?"

 

•gives multiple valid responses when asked to "read between the data" and can make coherent and comprehensive comparisons

 

•gives a response that is valid, complete, and consistent when asked to "read beyond the data"

Figure 1: Statistical Thinking Framework


examine changes in children’s statistical thinking.

The Statistical Thinking Framework incorporates four key data handling processes adapted from Shaughnessy et al.(1996): describing, organizing and reducing, representing, analyzing and interpreting data. Describing data involves the explicit reading of data contained in visual displays. Organizing and reducing data involves ordering, grouping, and summarizing data using measures of center and spread. Representing data incorporates the construction of visual displays and analyzing and interpreting data includes what Curcio and Artzt (1997) refer to as “reading between the data” and “reading beyond the data” (p. 124). The Framework descriptors for each statistical process built on previous research (e.g. Beaton et al., 1996; Bright & Friel, 1998; Zawojewski & Heckman, 1997; Curcio & Artzt, 1997). For each of these processes, four levels of thinking were hypothesized and validated (Jones et al., in press). Level 1 is associated with idiosyncratic thinking, Level 2 is transitional between idiosyncratic and quantitative thinking, Level 3 involves the use of informal quantitative thinking, and Level 4 incorporates analytical and numerical thinking. These levels of thinking are consistent with neo-Piagetian theories that postulate the existence of thinking levels that recycle during developmental stages (e.g., Biggs & Collis, 1991).

Method

Children from 2 intact grade 1 classes in a midwest school participated in the teaching experiment on data exploration. One class collected the data on number of missing teeth from its own members (Collection Group, n=20) and the other class also used this data (NonCollection Group, n= 18). We hypothesized that this collection activity might work in favor of the Collection Group by building a better contextual base for data exploration. In addition to the overall analysis involving all children, 3 children from each class were purposefully sampled as target for more detailed case-study analysis. For each class,1 student was chosen from each of the upper and lower quarters and 1 from the middle 50% on mathematics achievement.

The intervention phase of the teaching experiment for these grade 1 children comprised four 40-minute sessions spread over a 3-week period. Sessions opened with a whole-class exploration posed by the third author. Ten teacher education undergraduate mentors facilitated children’s solving of data exploration problems that were based on the missing-teeth data and linked to the Framework. To ensure that the inquiry orientation of the intervention phase was met, mentors participated in weekly seminars that explored strategies for fostering children’s statistical thinking.

Data were gathered from three sources: (a) researcher-designed interview assessment protocols administered in the weeks preceding and following the intervention, (b) mentor evaluations from each instructional session, and (c) researcher field notes on the 6 target students. The assessment protocol (Jones et al., in press) based on the Framework comprised 37 items (7 on describing, 13 on organizing and reducing, 2 on representing data, and 15 on analyzing and interpreting data) in three contexts: How Many Friends Came to Visit? Beanie Babies and The Beanbag Game. A double-coding procedure (Miles & Huberman, 1994) was used to establish pre and postintervention statistical thinking levels for all students in both classes. In this procedure, the first two authors independently coded, according to the levels of the framework, all questions on each child’s protocol. The modal response level for each statistical thinking process was used to determine children’s dominant thinking levels. The authors reached agreement on 82% of children’s dominant thinking levels. A Wilcoxon Signed Ranks Test (Siegel & Castellan, 1988) was used to compare pre and postintervention statistical levels for each of the two classes. Both “within” and “cross-case displays”(Miles & Huberman, 1994) were used to guide the analysis of qualitative data from the 6 target students. Mentor evaluations and researcher field notes were coded and synthesized to discern learning patterns exhibited by these students during the intervention.

Results and Conclusions

The effect of the teaching experiment: Quantitative analysis. The Wilcoxon Signed Ranks Test

 (Siegel & Castellan, 1988) revealed differences between the pre and postintervention thinking levels of the grade 1 children that were significant for some statistical thinking processes and not for others. For describing data, only the NonCollection Group showed a significant difference (Collection Group, p< .08; NonCollection Group, p< .01); for organizing and reducing data, both Groups showed significant differences (Collection Group, p< .04; NonCollection Group, p< .01); for representing data, neither group showed significant differences (Collection Group, p<.17; NonCollection Group, p< .42); and for analyzing and interpreting data only the Collection Group showed a significant difference (Collection Group, p< .01; NonCollection Group, p< .65). While the statistical thinking of the children in the two groups changed in slightly different ways, the evidence does not support a stronger overall growth in favor of the Collection Group.When the two classes were combined the differences between children's pre and postintervention thinking levels were significant for all statistical processes except representing data. For the three significant statistical thinking processes, the most salient feature of the data was that the number of children exhibiting Level 3 increased following the intervention and this was accompanied by a corresponding decrease in the number exhibiting Level 1 thinking.

The effect of the teaching experiment: Case-study analysis. A number of learning patterns and trends were discerned by examining the relationship between target students' thinking during instruction and their thinking at the pre and postintervention assessments. These patterns are described and interpreted for each of the four statistical thinking processes. With regard to describing data, students brought some prior knowledge to the classroom. For example, they recognized how categorical data such as "days of the week" were shown on a scale, how to find values on a line plot using counting, and how to read data in a table. Target students had more difficulty reading bar graphs than line plots and they made only cosmetic comparisons between a line plot and a bar graph of the same data. However, during instruction they gave less idiosyncratic responses when reading visual displays and became more facile in comparing different displays and organizations of the missing teeth data. With regard to organizing and reducing data, students’ informal knowledge prior to instruction was limited. However, during instruction and in the postassessment most target students demonstrated informal notions of mode or middle (median) in dealing with center. A smaller number showed some idea of clustering in discussing spread. Although most children were able to organize the familiar Beanie Baby data, they were unable to construct different organizations of the missing teeth data. Nevertheless, when the children were shown two different organizations of the missing teeth data they were able to recognize that they represented the same data. With regard to representing data, students were more capable at completing unfinished graphs based on a given set of data than they were at representing a given graph in a different way. Their limited skills in reorganizing data also constrained their representations of data. Interestingly more than 50% of the target students were not able to represent “zero” on a display and zero data values also caused problems when children analyzed and interpreted data. With regard to analyzing and interpreting data, our students, like those of Curcio and Artzt (1997), demonstrated more normative thinking in tasks that involved reading between the data than in tasks that involved reading beyond the data. However, even though students read zero values they ignored them when analyzing and interpreting data. The instructional sequence helped target students to gain better facility in dealing with zero values and in reading beyond the data particularly as they became very familiar with the missing teeth data. By the end of the intervention, more students in both classes were able to make and justify meaningful predictions of how many friends would visit Sam in the next month based on the data for the previous week.

Given the prior knowledge and growth that children showed on the four statistical processes, there is evidence that they can accommodate a broader approach to data exploration. However, if instruction is to reach its full potential in the elementary grades, further research is needed to build learning trajectories that link different levels of children's statistical thinking.

References

Beaton, A. E., Mullis, I. V. V., Martine, M. O., Gonzalez, E. J., Kelly, D. L., & Smith, T. A. (1996).  Mathematics achievement in the middle school years: IEA'S third international mathematics and

     science study (TIMSS).  Chestnut Hill, MA: Center for the Study of Testing, Evaluation, and

Educational Policy, Boston College.

Biggs, J. B. & Collis, K. F. (1991). Multimodal learning and intelligent behavior. In H. Rowe (Ed.) Intelligence: Reconceptualiztion and measurement (pp. 57-76). Hillsdale, NJ: Erlbaum.

Bright, G. W., & Friel, S. N. (1998). Graphical Representations: Helping students interpret data. In S. P. Lajoie (Ed.), Reflectioins on statistics: Learning, teaching, and assessment in grades K-12 (pp. 63-88). Mahwah, NJ: Erlbaum.

Cobb, P. (1999). Individual and collective mathematical development: The case of statistical data analysis. Mathematical Thinking and Learning, 1 (1), 5-43.

Curcio, F. R. & Artzt, A. F. (1997). Assessing students’ statistical problem-solving behaviors in a small-group setting. In I. Gal & J. Garfield (Eds.), The assessment challenge in statistics education (pp. 123-138). Amsterdam, The Netherlands: IOS Press.

Fennema, E., Carpenter, T. P., Franke, M. L., Levi, L, Jacobs, V. R. & Empson, S. B. (1996). A longitudinal study of research to use children's thinking in mathematics instruction. Journal for Research in Mathematics Education, 27, 403-434.

Jones, G. A., Thornton, C. A., Langrall, C. W., Mooney, E., Perry, B., & Putt, I. (in press). A framework for assessing students’ statistical thinking. Mathematics Thinking and Learning.

Miles, M. B. & Huberman, A. M. (1994). Qualitative data analysis. Thousand Oaks, CA: Sage.

National Council of Teachers of Mathematics (2000). Principles and standards for school mathematics. Reston, VA: Author.

Siegel, S. & Castellan N. J., Jr. (1988). Nonparametric statistics for the behavioral sciences. New York: Mcgraw-Hill.

 Shaughnessy, J. M., Garfield, J., & Greer, B.  (1996). Data handling.  In A J. Bishop, M. A. Clements, C. Keitel, J. Kilpatrick, & C. Laborde (Eds.)  International handbook of mathematics education,(Part One, pp. 205-238). Dordrecht, The Netherlands: Kluwer.

Simon, M. A. (1995). Reconstructing mathematics pedagogy from a constructivist perspective. Journal for Research in Mathematics Education, 26, 114-145.

Zawojewski, J. S., & Heckman, D. S.  (1997).What do students know about data analysis, statistics, and probability? In P. A. Kenney & E. A. Silver (Eds.), Results from the sixth mathematics assessment of the national assessment of educational progress (pp. 195-223). Reston, VA: National Council of Teachers of Mathematics.