Rebecca Stone scite author profile

In this paper, we analyze the performance of name finding in the context of a variety of automatic speech recognition (ASR) systems and in the context of one optical character recognition (OCR) system. We explore the effects of word error rate from ASR and OCR, performance as a function of the amount of training data, and for speech, the effect of out-of-vocabulary errors and the loss of punctuation and mixed case I

show abstract

Audio Indexing of Arabic broadcast news

Billa¹,

Noamany²,

Srivastava³

et al. 2002

View full text Add to dashboard Cite

Algorithms that learn to extract information

Miller¹,

Crystal²,

Fox³

et al. 1996

View full text Add to dashboard Cite

All of BBN's research under the TIPSTER III program has focused on doing extraction by applying statistical models trained on annotated data, rather than by using programs that execute handwritten rules. Within the context of MUC-7, the SIFT system for extraction of template entities (TE) and template relations (TR) used a novel, integrated syntactic/semantic language model to extract sentence level information, and then synthesized information across sentences using in part a trained model for cross-sentence relations. At the named entity (NE) level as well, in both MET-1 and MUC-7, BBN employed a trained, HMM-based model. The results in these TIPSTER evaluations are evidence that such trained systems, even at their current level of development, can perform roughly on a par with those based on rules handtailored by experts. In addition, such trained systems have some significant advantages: • They can be easily ported to new domains by simply annotating fresh data. • The complex interactions that make rulebased systems difficult to develop and maintain can here be learned automatically from the training data. We believe that improved and extended versions of such trained models have the potential for significant further progress toward practical systems for information extraction.

show abstract

Experiments in multi-modal automatic content extraction

Ramshaw¹,

Boschee²,

Bratus³

et al. 2001

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Rebecca Stone

Demonstration of cue recruitment: Change in visual appearance by means of Pavlovian conditioning

Named entity extraction from noisy input

Audio Indexing of Arabic broadcast news

Algorithms that learn to extract information

Experiments in multi-modal automatic content extraction

Contact Info

Product

Resources

About