Text categorization for multiple users based on semantic features from a machine-readable dictionary

Liddy, Elizabeth D.; Paik, Woojin; Yu, Edmund S.

doi:10.1145/183422.183425

Cited by 24 publications

(12 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…All these research endeavours have encouraged other researchers to continue and expand this work by synthesizing term level statistical techniques with epiphanic semantic processing in order to improve efficiency and effectiveness of automatic text categorization systems (Liddy, Paik & Yu, 1994). All these approaches are noteworthy efforts to address the differences between term expressions and term meanings however machine understanding and learning still rudimentarily remains an approximation of the anthropologic ability to read and understand.…”

Section: Information Retrieval Systemsmentioning

confidence: 99%

Information Retrieval Systems: A Perspective on Human Computer Interaction

Petratos¹

2006

IISIT

View full text Add to dashboard Cite

Traditional information systems design and development methodologies tend to overly focus on the technical details of the system such as memory management, system internals, algorithms and modules. It is not unusual for system designers and developers to often completely omit from the thought process the human element. This article offers a new information systems perspective particularly for information retrieval systems with a focus on human computer interaction.

show abstract

Section: Information Retrieval Systemsmentioning

confidence: 99%

Information Retrieval Systems: A Perspective on Human Computer Interaction

Petratos¹

2006

IISIT

View full text Add to dashboard Cite

show abstract

“…Standard statistical approaches exist for automated thesaurus generation [Salton and McGill 1983]. Several other approaches based on machine learning and NLP techniques have also been reported in the literature for automatic term discovery [Futrelle et al 1994;Guntzer et al 1988] and refinement [Liddy et al 1994]. Of course, users should also have the option to introduce new terms to suit their individual needs.…”

Section: Future Extensions Of Siftermentioning

confidence: 99%

A multilevel approach to intelligent information filtering

Mostafa

Mukhopadhyay

Palakal

et al. 1997

ACM Trans. Inf. Syst.

139

View full text Add to dashboard Cite

In information-filtering environments, uncertainties associated with changing interests of the user and the dynamic document stream must be handled efficiently. In this article, a filtering model is proposed that decomposes the overall task into subsystem functionalities and highlights the need for multiple adaptation techniques to cope with uncertainties. A filtering system, SIFTER, has been implemented based on the model, using established techniques in information retrieval and artificial intelligence. These techniques include document representation by a vector-space model, document classification by unsupervised learning, and user modeling by reinforcement learning. The system can filter information based on content and a user's specific interests. The user's interests are automatically learned with only limited user intervention in the form of optional relevance feedback for documents. We also describe experimental studies conducted with SIFTER to filter computer and information science documents collected from the Internet and commercial database services. The experimental results demonstrate that the system performs very well in filtering documents in a realistic problem setting.

show abstract

“…Many text classification problems [4], [5], [6] run the learning algorithm on words from a standard dictionary. The use of a dictionary allows the use of only standard words and thus reduces unwanted noise that can come in the forms described above.…”

Section: Introductionmentioning

confidence: 99%

Use of a visual word dictionary for topic discovery in images

Kandasamy

Rodrigo

2010

2010 Fifth International Conference on Information and Automation for Sustainability

View full text Add to dashboard Cite

The bag of visual words model has seen immense success in addressing the problem of image classification. Algorithms using this model generate the codebook of visual words by vector quantizing the features (such as SIFT) of the images to be classified. However, a codebook so formed tends to get biased by the nature of the dataset. In this paper we propose an alternative method to create the codebook for the dataset of images to be classified. Instead of directly using the dataset itself we first create a visual word dictionary by studying the SIFT features of a universal set of images. The codebook for the images to be classified is derived from this dictionary. To assess the effectiveness of the codebook thus derived, we classify the images using Probabilistic Latent Semantic Analysis in an unsupervised setting and Naive Bayes’ classification in a supervised setting. The use of a dictionary achieves results comparable to those obtained via a codebook formed from the dataset itself in much less computational time. We also use the dictionary to demonstrate how analogies can be drawn between visual words and linguistic words and present an analysis on one such analogy—that of polysemy

show abstract

Text categorization for multiple users based on semantic features from a machine-readable dictionary

Cited by 24 publications

References 2 publications

Information Retrieval Systems: A Perspective on Human Computer Interaction

Information Retrieval Systems: A Perspective on Human Computer Interaction

A multilevel approach to intelligent information filtering

Use of a visual word dictionary for topic discovery in images

Contact Info

Product

Resources

About