Vincent Ng scite author profile

We present a noun phrase coreference system that extends the work of Soon et al. (2001) and, to our knowledge, produces the best results to date on the MUC-6 and MUC-7 coreference resolution data sets -F-measures of 70.4 and 63.4, respectively. Improvements arise from two sources: extra-linguistic changes to the learning framework and a large-scale expansion of the feature set to include more sophisticated linguistic knowledge.

show abstract

Automatic Keyphrase Extraction: A Survey of the State of the Art

Hasan¹,

Ng²

2014

359

186

View full text Add to dashboard Cite

show abstract

Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution

Cardie

2002

103

108

View full text Add to dashboard Cite

show abstract

Modeling Argument Strength in Student Essays

Persing¹,

Ng²

2015

107

101

View full text Add to dashboard Cite

While recent years have seen a surge of interest in automated essay grading, including work on grading essays with respect to particular dimensions such as prompt adherence, coherence, and technical quality, there has been relatively little work on grading the essay dimension of argument strength, which is arguably the most important aspect of argumentative essays. We introduce a new corpus of argumentative student essays annotated with argument strength scores and propose a supervised, feature-rich approach to automatically scoring the essays along this dimension. Our approach significantly outperforms a baseline that relies solely on heuristically applied sentence argument function labels by up to 16.1%.

show abstract

Modeling Stance in Student Essays

Persing¹,

Ng²

2016

View full text Add to dashboard Cite

Essay stance classification, the task of determining how much an essay's author agrees with a given proposition, is an important yet under-investigated subtask in understanding an argumentative essay's overall content. We introduce a new corpus of argumentative student essays annotated with stance information and propose a computational model for automatically predicting essay stance. In an evaluation on 826 essays, our approach significantly outperforms four baselines, one of which relies on features previously developed specifically for stance classification in student essays, yielding relative error reductions of at least 11.3% and 5.3%, in micro and macro F-score, respectively. 1 Previous approaches to stance classification have focused on three discussion/debate settings, namely congressional floor debates (

show abstract

Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews

Dasgupta

Arifin

2006

187

View full text Add to dashboard Cite

This paper examines two problems in document-level sentiment analysis: (1) determining whether a given document is a review or not, and (2) classifying the polarity of a review as positive or negative. We first demonstrate that review identification can be performed with high accuracy using only unigrams as features. We then examine the role of four types of simple linguistic knowledge sources in a polarity classification system.

show abstract

End-to-End Argumentation Mining in Student Essays

Persing¹,

Ng²

2016

View full text Add to dashboard Cite

State-of-the-art systems for argumentation mining are supervised, thus relying on training data containing manually annotated argument components and the relationships between them. To eliminate the reliance on annotated data, we present a novel approach to unsupervised argument mining. The key idea is to bootstrap from a small set of argument components automatically identified using simple heuristics in combination with reliable contextual cues. Results on a Stab and Gurevych's corpus of 402 essays show that our unsupervised approach rivals two supervised baselines in performance and achieves 73.5−83.7% of the performance of a state-of-the-art neural approach.

show abstract

Efficient mining of association rules in distributed databases

Cheung

et al. 1996

IEEE Trans. Knowl. Data Eng.

277

View full text Add to dashboard Cite

Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. In this study, an efficient algorithm, DMA, is proposed. It generates a small number of candidate sets and requires only O(n) messages for support count exchange for each candidate set, where n is the number of sites in a distributed database. The algorithm has been implemented on an experimental test bed and its performance is studied. The results show that DMA has superior performance when comparing with the direct application of a popular sequential algorithm in distributed databases.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vincent Ng

Improving machine learning approaches to coreference resolution

Automatic Keyphrase Extraction: A Survey of the State of the Art

Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution

Modeling Argument Strength in Student Essays

Modeling Stance in Student Essays

Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews

End-to-End Argumentation Mining in Student Essays

Efficient mining of association rules in distributed databases

Contact Info

Product

Resources

About