1994
DOI: 10.1145/183422.183425
|View full text |Cite
|
Sign up to set email alerts
|

Text categorization for multiple users based on semantic features from a machine-readable dictionary

Abstract: The text categorization module described here provides a front-end filtering function for the larger DR-LINK text retrieval system [Liddy and Myaeing 1993]. The model evaluates a large incoming stream of documents to determine which documents are sufficiently similar to a profile at the broad subject level to warrant more refined representation and matching. To accomplish this task, each substantive word in a text is first categorized using a feature set based on the semantic Subject Field Codes (SFCs) assigne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

1997
1997
2010
2010

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(12 citation statements)
references
References 2 publications
0
12
0
Order By: Relevance
“…All these research endeavours have encouraged other researchers to continue and expand this work by synthesizing term level statistical techniques with epiphanic semantic processing in order to improve efficiency and effectiveness of automatic text categorization systems (Liddy, Paik & Yu, 1994). All these approaches are noteworthy efforts to address the differences between term expressions and term meanings however machine understanding and learning still rudimentarily remains an approximation of the anthropologic ability to read and understand.…”
Section: Information Retrieval Systemsmentioning
confidence: 99%
“…All these research endeavours have encouraged other researchers to continue and expand this work by synthesizing term level statistical techniques with epiphanic semantic processing in order to improve efficiency and effectiveness of automatic text categorization systems (Liddy, Paik & Yu, 1994). All these approaches are noteworthy efforts to address the differences between term expressions and term meanings however machine understanding and learning still rudimentarily remains an approximation of the anthropologic ability to read and understand.…”
Section: Information Retrieval Systemsmentioning
confidence: 99%
“…Standard statistical approaches exist for automated thesaurus generation [Salton and McGill 1983]. Several other approaches based on machine learning and NLP techniques have also been reported in the literature for automatic term discovery [Futrelle et al 1994;Guntzer et al 1988] and refinement [Liddy et al 1994]. Of course, users should also have the option to introduce new terms to suit their individual needs.…”
Section: Future Extensions Of Siftermentioning
confidence: 99%
“…Many text classification problems [4], [5], [6] run the learning algorithm on words from a standard dictionary. The use of a dictionary allows the use of only standard words and thus reduces unwanted noise that can come in the forms described above.…”
Section: Introductionmentioning
confidence: 99%