An examination of alternate assessment durations when assessing multiple-skill computational fluency: The generalizability and dependability of curriculum-based outcomes within the context of educational decisions

Christ, Theodore J.; Johnson-Gros, Kristin N.; Hintze, John M.

doi:10.1002/pits.20107

Cited by 24 publications

(28 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hence, the median score of three 2-min probes should be used when making decisions (Hintze et al, 2002). The results of some studies provide support for repeated 2-min administrations to derive the median level of performance (e.g., Hintze et al, 2002), whereas others support the use of extended administration durations (Christ, Johnson-Gros, & Hintze, 2005).…”

Section: Stimulus Materials and Probe Constructionmentioning

confidence: 96%

Implications of Recent Research

Christ

Scullin

Tolbize

et al. 2008

Assessment for Effective Intervention

Self Cite

View full text Add to dashboard Cite

Curriculum-based measurement of mathematics (CBM-M) comprises a set of procedures and instrumentation to assess the level and trend of student achievement in early mathematics. The purpose of this article is to review the recent research and psychometric evidence for CBM-M. Although recent developments in CBM-M include procedures to assess early numeracy and application problems, this review focuses exclusively on computation assessment. The results of this review provide evidence that CBM-M is sufficiently reliable and valid for some applications; however, interpretation must be informed by the context and the scope of assessment domain. Mathematics computation is a subdomain of mathematics curriculum and assessment, and therefore, the validity of CBM-M is limited by its construct representation (i.e., stimulus set and task demands). Nevertheless, the review provides support for ongoing development and use of CBM-M as both a general outcome measure and subskill mastery measure for computation. Implications for research and practice are discussed.

show abstract

Section: Stimulus Materials and Probe Constructionmentioning

confidence: 96%

Implications of Recent Research

Christ

Scullin

Tolbize

et al. 2008

Assessment for Effective Intervention

Self Cite

View full text Add to dashboard Cite

show abstract

“…We also provided evidence for concurrent validity by showing high correlations between students' scores on the probes and their scores on the criterion measure of the MAIT (Christ et al., 2008; Shapiro et al., 2006; Thurber et al., 2002). Using three methods for testing reliability, namely test–retest (Christ et al., 2005), alternate format (Burns & Vanderhyden, 2006; Petscher, Cummings, Biancarosa, & Fien, 2010; Turber et al., 2002), and inter-rater reliability (Turber et al., 2002), the results showed that the measures are reliable and adequate for use by teachers and practitioners in the Omani context.…”

Section: Discussionmentioning

confidence: 99%

“…There is evidence supporting the use of MC-CBM for monitoring students' progress (Stecker, Fuchs, & Fuchs, 2005), modifying instruction for students with varying abilities (Slavin, & Lake, 2008), informing instructional groupings (McLeskey & Waldron, 2011), adapting instruction for students with disabilities (McMaster, Fuchs, Fuchs, & Compton, 2005), identifying students' academic strengths and weaknesses (Stecker, Fuchs, & Fuchs, 2005), supporting peer-assisted learning (Calhoon & Fuchs, 2003), and predicting students' performance on statewide assessments (Helwig, Anderson, & Tindal, 2002). The current study was informed by studies that examined performance indicators on MC-CBM to classify students (Burns, VanDerHeyden, & Jiban, 2006; VanDerHeyden & Burns, 2005), investigated the administration of MC-CBM on higher (Fuchs, Fuchs, Compton, Bryant, Hamlett, & Seethaler, 2007) compared to earlier grades (Clarke & Shinn, 2004; Hintze Christ, & Keller, 2002; Jitendra, Dupuis, & Zaslofsky, 2014; Shapiro, Dennis, & Fu, 2015; Thurber et al., 2002), and used MC-CBM to screen for students with LD in mathematics (Christ, Johnson-Gros, & Hintze, 2005; Fuchs et al, 2007; Hintze et al., 2002).…”

Section: Cbm In Mathematicsmentioning

confidence: 99%

“…Although MC-CBM has gained robust empirical support, the findings concerning the psychometric properties of the MC-CBM used metric (i.e., digits correct per unit of time) and the assessment context (single-skill or multiple-skill) are inconclusive (Christ et al., 2005; Hintze et al., 2002; Methe, Briesch, and Hulac, 2015; Strait, Smith, Pender, Malone, Roberts, & Hall, 2015). Christ et al. (2005), for example, examined the generalizability and dependability of MC-CBM assessments across various assessment durations (1, 2, 3, 4, 5, and 6 minutes).…”

Section: Cbm In Mathematicsmentioning

confidence: 99%

See 1 more Smart Citation

Development of curriculum-based measurements in mathematical computations for Arab-speaking fourth grade students

Al-Shehhi

Emam

Al-Otaiba

et al. 2018

School Psychology International

View full text Add to dashboard Cite

In the Arab region, several assessments are available to evaluate student skills in mathematical computations. However, none of them uses formative evaluation to guide universal screening of struggling learners or students with learning disability (LD). The current study aimed to develop mathematical computation curriculum-based measurement (MC-CBM) for Arab speaking fourth grade students, examine its psychometric properties, test its adequacy for use in an Arab context, namely Oman, determine an adequate time for its administration, and develop performance benchmarks. MC-CBM were administered to 528 fourth grade students. Results indicated that the developed measures were adequate for use in the Arab context. Received operation characteristic (ROC) curve indicated good specificity and sensitivity estimates for the MC-CBM. Performance benchmarks were obtained using the 25th and 75th percentiles. Implications are discussed from a contextual perspective.

show abstract

“…These can be conducted on various facets and are similar to the Spearman-Brown prophecy formula, which is used to determine how many items may be needed to obtain a reliable estimate of a trait (Cronbach et al, 1972). D studies have been employed to assess how dependability changes as a function of the length of probes (e.g., Christ, Johnson-Gros, & Hintze, 2005;Volpe et al, 2011) or the number of probes administered (e.g., Poncy, Skinner, & Axtell, 2005). Thus, a universal screening D study could examine the effect of increasing the number of occasions or ratings on the generalizability of the overall procedure (Webb, Shavelson, & Haertel, 2006).…”

Section: Methodmentioning

confidence: 99%