Faculty Interjudge Reliability of Music Performance Evaluation

Bergee, Martin J.

doi:10.2307/3345847

Cited by 64 publications

(113 citation statements)

References 11 publications

(17 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Research in musical adjudication has spawned a number of facetfactorial studies, elegantly summarized in Bergee (2003), as well as additional research. Studies have almost always dealt with intra-and interjudge reliability and issues of validity, as well as with correlations between test items.…”

Section: Jrme 163mentioning

confidence: 99%

Effects of Excerpt Tempo and Duration on Musicians' Ratings of High-Level Piano Performances

Wapnick

Ryan²,

Campbell

et al. 2005

Journal of Research in Music Education

View full text Add to dashboard Cite

The purpose of this study war to determine how judgments of solo performances rpcorded at an internationalpiano coinpetition might be affeed by excerpt duration (20 uersus 60 seconds) and tempo (slow versus f a t ) . Musicians rated perfonmiices on six test item. Results indicated that piano majors rated slow exceqts higher than t h g rated fart excerpts, and that t h g rated slow excerpts higherthan nonpiano majors rated either slow or fast excerpts; undergraduates rated long excerpts the same as or slightly higher than thqr did short excerpts, but graduate students and faculty rated long excerpts markedly higher than short excerpts; and undeqpduate piano majors rated performances lower than did undergraduate nonpiano majors, but graduate piano m a j m and facuIty rated performances higher than did graduate/faculty nonpiano majors. Also, accuracy i t e m correlated Witia each other morz high4 than fhqr did with other item.s, and judge ratings were higher for accuracy i t e m than thqr were for other item. Finally, judge consisten9 was shown to be related to excerpt duration, excerpt tempo, instrumental majol; and level of education. Results generally show that ratings taken aJer 60 seconds differed frvin ratings taken after 20 seconds, and that ratings prvuided evidence that judges were able l o distinguish between dijjferent test items.

show abstract

Section: Jrme 163mentioning

confidence: 99%

Effects of Excerpt Tempo and Duration on Musicians' Ratings of High-Level Piano Performances

Wapnick

Ryan²,

Campbell

et al. 2005

Journal of Research in Music Education

View full text Add to dashboard Cite

show abstract

“…With the general consensus on the importance of sound in the domain of music, as "an art of sound" (40), it follows that experts and key decision makers would privilege auditory-related rating in professional evaluation and assessment, even when such items show insufficient reliability (41)(42)(43)(44)(45). However, despite all that is invested in the auditory domain, low interrater correlations suggest that such basis of evaluation is an unreliable process.…”

mentioning

confidence: 99%

Sight over sound in the judgment of music performance

Tsay

2013

Proc. Natl. Acad. Sci. U.S.A.

172

199

View full text Add to dashboard Cite

Social judgments are made on the basis of both visual and auditory information, with consequential implications for our decisions. To examine the impact of visual information on expert judgment and its predictive validity for performance outcomes, this set of seven experiments in the domain of music offers a conservative test of the relative influence of vision versus audition. People consistently report that sound is the most important source of information in evaluating performance in music. However, the findings demonstrate that people actually depend primarily on visual information when making judgments about music performance. People reliably select the actual winners of live music competitions based on silent video recordings, but neither musical novices nor professional musicians were able to identify the winners based on sound recordings or recordings with both video and sound. The results highlight our natural, automatic, and nonconscious dependence on visual cues. The dominance of visual information emerges to the degree that it is overweighted relative to auditory information, even when sound is consciously valued as the core domain content.

show abstract

“…Each dimension on Form B was rated lower than its counterpart on Form A (since the number &dquo;1&dquo; is considered the &dquo;best&dquo; score, lower scores or ratings are indicated by higher numbers). Paired-samples t-tests revealed significant differences between forms at the .05 level or lower in the following dimensions: tone ( t = -2.27, p = .027), diction ( t = -2.40, p=.02), blend ( t = -3.36, p = .001 ) , intonation ( t = -2.34, p = .023), rhythm ( t = -2.80, p = .007), balance ( = -4.09, p < .001 ) , total score ( = -3.94, p < .001), and rating ( (Garman, Barry, & DeCarbo, 1991;Bergee, 1988Bergee, , 1989Bergee, , 1993Bergee, , 1997Bergee, , 2003 The additional analysis of the means of dimensions, total score, and overall ratings corroborates the above comments (see Table 1 and above t-test results). Form B yielded significantly different ratings in every dimension except interpretation, suggesting that the adjudicators in this setting rated the choirs more severely when using Form B.…”

mentioning

confidence: 99%

An Examination of the Reliabilities of Two Choral Festival Adjudication Forms

Norris

Borst²

2007

Journal of Research in Music Education

View full text Add to dashboard Cite

The purpose of this study was to compare the reliability of a common school choral festival adjudication form with that of a second form that is a more descriptive extension of the first. Specific research questions compare the interrater reliabilities of each form, the differences in mean scores of all dimensions between the forms, and the concurrent validity of the forms. Analysis of correlations between all possible pairs of four judges determined that the interrater reliability of the second form was stronger than that of the traditional form. Moderate correlations between the two forms further support the notion that the two forms measured the dimensions in somewhat different ways, suggesting the second form offered more specific direction in the evaluation of the choral performances. The authors suggest continued development of language and descriptors within a rubric that might result in increased levels of interrater reliability and validity in each dimension.

show abstract

Faculty Interjudge Reliability of Music Performance Evaluation

Cited by 64 publications

References 11 publications

Effects of Excerpt Tempo and Duration on Musicians' Ratings of High-Level Piano Performances

Effects of Excerpt Tempo and Duration on Musicians' Ratings of High-Level Piano Performances

Sight over sound in the judgment of music performance

An Examination of the Reliabilities of Two Choral Festival Adjudication Forms

Contact Info

Product

Resources

About