Catherine Trapani scite author profile

Automated scoring models for the e-rater ® scoring engine were built and evaluated for the GRE ® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in mean scores, and correlations with external measures were examined to evaluate the e-rater model performance against human scores. Performance was also evaluated across different demographic subgroups. Additional analyses were performed to establish appropriate agreement thresholds between human and e-rater scores for unusual essays and the impact of using e-rater on operational scores. The generic e-rater scoring model with operational prompt-specific intercept for the issue-writing task and prompt-specific e-rater scoring model for the argument writing task were recommended for operational use. The two automated scoring models were implemented to produce check scores at a discrepancy threshold of 0.5 with human scores.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Catherine Trapani

Validity of DSM‐IV Attention‐Deficit/Hyperactivity Disorder for Younger Children

Comparison of Human and Machine Scoring of Essays: Differences by Gender, Ethnicity, and Country

EVALUATION OF THE E‐RATER^® SCORING ENGINE FOR THE GRE^® ISSUE AND ARGUMENT PROMPTS

Contact Info

Product

Resources

About

Catherine Trapani

Validity of DSM‐IV Attention‐Deficit/Hyperactivity Disorder for Younger Children

Comparison of Human and Machine Scoring of Essays: Differences by Gender, Ethnicity, and Country

EVALUATION OF THE E‐RATER® SCORING ENGINE FOR THE GRE® ISSUE AND ARGUMENT PROMPTS

Contact Info

Product

Resources

About

EVALUATION OF THE E‐RATER^® SCORING ENGINE FOR THE GRE^® ISSUE AND ARGUMENT PROMPTS