“…To annotate the summary content, we use Pyramid annotation (Nenkova et al, 2007), a summary content annotation that has been shown to correlate with a main ideas rubric used in an educational intervention with community college students (Passonneau et al, 2018). As in that study, we collect five reference summaries written by more advanced students, referred to as a wise crowd.…”
Section: Annotation Of Essay Contentmentioning
confidence: 99%
“…To follow the principles of pyramid annotation applied to education (Passonneau et al, 2018), we collected wise crowd essays written by sophomores who took the academic skills course in the previous year and by the trained raters on the project (advanced undergraduates), to constitute 2 Currently in submission to another venue. five references per topic.…”
We present a unique dataset of student sourcebased argument essays to facilitate research on the relations between content, argumentation skills, and assessment. Two classroom writing assignments were given to college students in a STEM major, accompanied by a carefully designed rubric. The paper presents a reliability study of the rubric, showing it to be highly reliable, and initial annotation on content and argumentation annotation of the essays.
“…To annotate the summary content, we use Pyramid annotation (Nenkova et al, 2007), a summary content annotation that has been shown to correlate with a main ideas rubric used in an educational intervention with community college students (Passonneau et al, 2018). As in that study, we collect five reference summaries written by more advanced students, referred to as a wise crowd.…”
Section: Annotation Of Essay Contentmentioning
confidence: 99%
“…To follow the principles of pyramid annotation applied to education (Passonneau et al, 2018), we collected wise crowd essays written by sophomores who took the academic skills course in the previous year and by the trained raters on the project (advanced undergraduates), to constitute 2 Currently in submission to another venue. five references per topic.…”
We present a unique dataset of student sourcebased argument essays to facilitate research on the relations between content, argumentation skills, and assessment. Two classroom writing assignments were given to college students in a STEM major, accompanied by a carefully designed rubric. The paper presents a reliability study of the rubric, showing it to be highly reliable, and initial annotation on content and argumentation annotation of the essays.
“…PyrEval differs from other automated pyramid tools in its focus on accurately isolating and weighting the distinct SCUs in the reference summaries. Three previous semi-automated pyramid tools used dynamic programming to score summaries, given a manual pyramid (Harnly et al, 2005;Passonneau et al, 2013Passonneau et al, , 2018. The first of these used unigram overlap to compare summaries to a pyramid.…”
Section: Related Workmentioning
confidence: 99%
“…A subsequent extension that used cosine similarity of latent vector representations of ngrams and SCUs, based on (Guo and Diab, 2012), had much better performance (Passonneau et al, 2013). This was extended further through use of a weighted set cover algorithm for scoring (Passonneau et al, 2018). PEAK was the first fully automated approach to construct a pyramid and score summaries (Yang et al, 2016).…”
Section: Related Workmentioning
confidence: 99%
“…Evaluation summaries (ESUM) are preprocessed in a similar fashion to convert them to segments represented as vectors. As in (Passonneau et al, 2018), PyrEval applies WMIN (Sakai et al, 2003) to find the optimal set of matches with pyramid SCUs. The remainder of this section describes each step.…”
Pyramid evaluation was developed to assess the content of paragraph length summaries of source texts. A pyramid lists the distinct units of content found in several reference summaries, weights content units by how many reference summaries they occur in, and produces three scores based on the weighted content of new summaries. We present an automated method that is more efficient, more transparent, and more complete than previous automated pyramid methods. It is tested on a new dataset of student summaries, and historical NIST data from extractive summarizers.
The explosive growth of online education environments is generating a massive volume of data, specially in text format from forums, chats, social networks, assessments, essays, among others. It produces exciting challenges on how to mine text data in order to find useful knowledge for educational stakeholders. Despite the increasing number of educational applications of text mining published recently, we have not found any paper surveying them. In this line, this work presents a systematic overview of the current status of the Educational Text Mining field. Our final goal is to answer three main research questions: Which are the text mining techniques most used in educational environments? Which are the most used educational resources? And which are the main applications or educational goals? Finally, we outline the conclusions and the more interesting future trends.
This article is categorized under:
Application Areas > Education and Learning
Ensemble Methods > Text Mining
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.