We analyze some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system. In particular, we address issues such as the representation of text chunks, the inference approach needed to combine local NER decisions, the sources of prior knowledge and how to use them within an NER system. In the process of comparing several solutions to these challenges we reach some surprising conclusions, as well as develop an NER system that achieves 90.8 F 1 score on the CoNLL-2003 NER shared task, the best reported result for this dataset.
Making complex decisions in real world problems often involves assigning values to sets of interdependent variables where an expressive dependency structure among these can influence, or even dictate, what assignments are possible. Commonly used models typically ignore expressive dependencies since the traditional way of incorporating non-local dependencies is inefficient and hence leads to expensive training and inference.The contribution of this paper is two-fold. First, this paper presents Constrained Conditional Models (CCMs), a framework that augments linear models with declarative constraints as a way to support decisions in an expressive output space while maintaining modularity and tractability of training. The paper develops, analyzes and compares novel algorithms for CCMs based on Hidden Markov Models and Structured Perceptron. The proposed CCM framework is also compared to task-tailored models, such as semi-CRFs.Second, we propose CoDL, a constraint-driven learning algorithm, which makes use of constraints to guide semi-supervised learning. We provide theoretical justification for CoDL along with empirical results which show the advantage of using declarative constraints in the context of semi-supervised training of probabilistic models.
Schema rriotrhirrg is the pmblem of Jiiidirrg a correspondence bemeen schenras -relariorial darabase schernas or web reposirories. The od-hoc op/Jroaclm perform poorly on real-world schenm dire to the nbiirrdarice of rroii-dicrionan abbreviurioiis. Our opiiroadt 10 schenia inarching is based or1 deep iiridersrondiirg of the sclierna, 11s-irig an imegmtive probobilistir generative rnodel as a f i w i enark for conipariiig condidare irirerpretatiorrs. In ernpiriral ei~alirorioii, the p~p o s e d model perfonned well in interprerirrg relorioiral scl~e,,ms. and e.rhibired his/, resistmire to iioise 61 the model prarrreters.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.