2012
DOI: 10.5815/ijmecs.2012.04.04
|View full text |Cite
|
Sign up to set email alerts
|

Domain Based Ontology and Automated Text Categorization Based on Improved Term Frequency – Inverse Document Frequency

Abstract: Abstract-In recent years there has been a massive growth in textual information in textual information especially in the internet. People now tend to read more e-books than hard copies of the books. While searching for some topic especially some new topic in the internet it will be easier if someone knows the prerequisites and post-requisites of that topic. It will be easier for someone searching a new topic. Often the topics are found without any proper title and it becomes difficult later on to find which do… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
3
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…For semantic detection, the candidate semantic representation should be obtained first, there are some state-of-art methods such as keyword-based representation [29] and semantic-based representation [30]. Our ground-truth labels are some short sentences, and those sentences of a video might be composed of different words.…”
Section: Semantic Reconstruction Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…For semantic detection, the candidate semantic representation should be obtained first, there are some state-of-art methods such as keyword-based representation [29] and semantic-based representation [30]. Our ground-truth labels are some short sentences, and those sentences of a video might be composed of different words.…”
Section: Semantic Reconstruction Networkmentioning
confidence: 99%
“…To avoid the influence of semantic represents methods, we built the semantic representation tags by the manual selection which are K most common words in the training set. Manual selection is not feasible if the dataset is extremely large, then the semantic representation tags should build by semantic represent methods such as TF-IDF [29]. The task of semantic detection can be seen as a multi-label classification task.…”
Section: Semantic Reconstruction Networkmentioning
confidence: 99%
“…Based on these representations, they compute multiple clustering results using KMeans. Ray and Chandra [4] proposed an automated text categorization technique that will categorize the uncategorized documents. The idea was based on the Term Frequency -Inverse Document Frequency (tf-idf) method and a dependency graph is also provided in the domain based ontology so that the users can visualize the relations among the terms.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, the shallow visual feature is extracted based on the statistical pixel values of each category block. In addition, Term Frequency -Inverse Document Frequency (TF-IDF) [20] is also applied as a weighting method to reinforce the distinction between text format category blocks.…”
Section: Introductionmentioning
confidence: 99%