Reexamining subjective creativity assessments in science tasks: An application of the rater-mediated assessment framework and many-facet Rasch model.

Wang, Jue; Long, Haiying

doi:10.1037/aca0000470

Cited by 7 publications

(5 citation statements)

References 75 publications

(142 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The application of different criteria seems to be a more serious issue than that of different scoring procedures or scales because consensus can be hardly reached over these criteria even among judges with similar backgrounds (Long, 2014b; Long & Pang, 2015). Therefore, given that variations in rating criteria and procedures are mainly related to raters, creativity researchers must pay more attention to issues such as rater effects, rater cognition, and how they affect the validity of the assessment (Kaufman et al, 2008; Long, 2014b; Long & Pang, 2015; Primi et al, 2019; Wang & Long, in press).…”

Section: Resultsmentioning

confidence: 99%

“…Primi et al (2019) used Many-Facet Rasch Modeling (MFRM) to model rater effects and investigate the effect of missingness on product-based creativity assessment. New methodological theories and methods have also been proposed, such as Myszkowski and Storme’s (2019) Judge Response Theory and Myszkowski’s (2019) jrt developed in R. Wang and Long (in press) recently examined the rater effects under the theoretical framework of rater-mediated assessment, which refers to “assessments in which raters evaluate test-taker performances and use rating scale categories to describe the level of performance on one or more domains” (Engelhard & Wind, 2019, p. 475).…”

Section: Resultsmentioning

confidence: 99%

“…More articles, however, deviated from these requirements and only adhered to the core characteristic of the approach, or the employment of raters’ subjective criteria of creativity. In this sense, it may be more appropriate to label this assessment approach as subjective creativity assessment than product-based assessment or CAT (Long, 2014a; Silvia et al, 2008; Wang & Long, in press). Further, when selecting judges or rating criteria, some articles selected novice as judges (e.g., Kaufman et al, 2013), while others trained graduate or undergraduate students to become judges (e.g., Kaufman et al, 2016).…”

Section: Resultsmentioning

confidence: 99%

See 2 more Smart Citations

A Critical Review of Assessments of Creativity in Education

Long

Kerr

Emler

et al. 2022

Review of Research in Education

Self Cite

View full text Add to dashboard Cite

This chapter provides a systematic, synthesizing, and critical review of the literature related to assessments of creativity in education from historical, theoretical, empirical, and practical standpoints. We examined the assessments used in the articles focusing on education that are published from January 2010 to May 2021 in eight creativity, psychological, and educational journals. We found that the assessments of creativity in education are split between psychological and education research and have increased international participation. Additionally, these assessments are more general than specific and focus more on cognitive than noncognitive aspects. Like previous reviews of assessments of creativity in general, this review showed that creativity in education is still mainly assessed by divergent thinking or creativity tests, self-report questionnaires, and product-based subjective techniques. We analyzed the benefits and drawbacks of each approach and highlighted many innovations in the assessment. We further discussed how the major assessment approaches address race, ethnicity, class, and gender issues in education. We concluded the review with recommendations for how to better assess creativity in education and how assessments of creativity in education contribute to our understanding of the creative educational experience and democratizing education.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

A Critical Review of Assessments of Creativity in Education

Long

Kerr

Emler

et al. 2022

Review of Research in Education

Self Cite

View full text Add to dashboard Cite

show abstract

“…In the field of creativity research, the process of selecting and training judges, enacting the coding scheme, and calculating the reliability of the ratings is most often seen as a means to an end: the main intention in the field is to study originality or creativity itself, and therefore, we aim for reliable codes in order to methodologically support our substantive work. However, a small but notable body of research has focused on the process of human rating eo ipso , empirically investigating and making recommendations on the types of tasks, participants, coders, and trainings that best support rater agreement and reliability (see Wang & Long, 2022 for a recent example). In this paper, we build on that existing literature to develop a model to inform the field's understanding of rater variance when coding the originality of children's responses to creativity assessments.…”

Section: Summary Of Past Work On Judgment Quality In Creativity Asses...mentioning

confidence: 99%

What Makes Children's Responses to Creativity Assessments Difficult to Judge Reliably?

Dumas,

Acar,

Berthiaume

et al. 2023

Journal of Creative Behavior

View full text Add to dashboard Cite

Open‐ended verbal creativity assessments are commonly administered in psychological research and in educational practice to elementary‐aged children. Children's responses are then typically rated by teams of judges who are trained to identify original ideas, hopefully with a degree of inter‐rater agreement. Even in cases where the judges are reliable, some residual disagreement on the originality of the responses is inevitable. Here, we modeled the predictors of inter‐rater disagreement in a large (i.e., 387 elementary school students and 10,449 individual item responses) dataset of children's creativity assessment responses. Our five trained judges rated the responses with a high degree of consistency reliability (α = 0.844), but we undertook this study to predict the residual disagreement. We used an adaptive LASSO model to predict 72% of the variance in our judges' residual disagreement and found that there were certain types of responses on which our judges tended to disagree more. The main effects in our model showed that responses that were less original, more elaborate, prompted by a Uses task, from younger children, or from male students, were all more difficult for the judges to rate reliably. Among the interaction effects, we found that our judges were also more likely to disagree on highly original responses from Gifted/Talented students, responses from Latinx students who were identified as English Language Learners, or responses from Asian students who took a lot of time on the task. Given that human judgments such as these are currently being used to train artificial intelligence systems to rate responses to creativity assessments, we believe understanding their nuances is important.

show abstract

“…In accordance with standard scoring rubrics, 30 ships were randomly selected and scored by two external senior teachers, who had more than fifteen years of teaching experience and participated in relative research more than five years. A consensual assessment technique was used in this research (Wang & Long, 2022; Weiss et al, 2021). The correlation coefficients for design concept variation, novelty, and feasibility were 0.969, 0.853, and 0.698, indicating a high level of reliability (Mugenda & Mugenda, 2003).…”

Section: Research Design and Implementationmentioning

confidence: 99%

Effects of virtual reality on creativity performance and perceived immersion: A study of brain waves

Wang

Weng

Tsai

et al. 2022

Brit J Educational Tech

View full text Add to dashboard Cite

The purpose of this study was to explore the effects of virtual reality (VR) application on creative performance and immersion, evaluated through electroencephalography brain wave data to achieve accurate and robust results. In this study, 72 middle school teachers were recruited as participants, and a non-randomized control-group pre-test-posttest design was employed. The experimental group received VR-based design instruction, and the control group received lecture-based design training. Our results revealed that VR significantly affects immersion, especially with regard to attention. Additionally, VR had a positive effect on the feasibility of the creative process, although its effects on variety and novelty were inconclusive. VR was significantly correlated with theta, beta, and gamma brain wave activity. VR also increased attention-related and meditation-related brain wave activity and desynchronized alpha waves.

show abstract

Reexamining subjective creativity assessments in science tasks: An application of the rater-mediated assessment framework and many-facet Rasch model.

Cited by 7 publications

References 75 publications

A Critical Review of Assessments of Creativity in Education

A Critical Review of Assessments of Creativity in Education

What Makes Children's Responses to Creativity Assessments Difficult to Judge Reliably?

Effects of virtual reality on creativity performance and perceived immersion: A study of brain waves

Contact Info

Product

Resources

About