2022
DOI: 10.48550/arxiv.2201.03425
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers

Abstract: Autograding short textual answers has become much more feasible due to the rise of NLP and the increased availability of question-answer pairs brought about by a shift to online education. Autograding performance is still inferior to human grading. The statistical and black-box nature of state-of-the-art machine learning models makes them untrustworthy, raising ethical concerns and limiting their practical utility. Furthermore, the evaluation of autograding is typically confined to small, monolingual datasets … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“…Gomaa and Fahmy (2019) use pretrained skip-thought vectors and learn a logistic classifier over the component-wise product and absolute difference vectors. Schneider et al (2022) report promising results on a notpublicly-available dataset by learning embeddings for question-answer-pairs and utilize cosine similarity as distance metric.…”
Section: Introductionmentioning
confidence: 99%
“…Gomaa and Fahmy (2019) use pretrained skip-thought vectors and learn a logistic classifier over the component-wise product and absolute difference vectors. Schneider et al (2022) report promising results on a notpublicly-available dataset by learning embeddings for question-answer-pairs and utilize cosine similarity as distance metric.…”
Section: Introductionmentioning
confidence: 99%