2012
DOI: 10.7232/iems.2012.11.1.087
|View full text |Cite
|
Sign up to set email alerts
|

An Optimal Weighting Method in Supervised Learning of Linguistic Model for Text Classification

Abstract: This paper discusses a new weighting method for text analyzing from the view point of supervised learning. The term frequency and inverse term frequency measure (tf-idf measure) is famous weighting method for information retrieval, and this method can be used for text analyzing either. However, it is an experimental weighting method for information retrieval whose effectiveness is not clarified from the theoretical viewpoints. Therefore, other effective weighting measure may be obtained for document classifica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Text mining is a knowledge discovery technology that enables researchers to discern patterns and trends based on unstructured text. It is possible to extract hidden knowledge using approaches such as natural language analysis, information retrieval, information extraction, and data mining [17,[39][40][41][42][43][44][45][46]. Patent documents contain lengthy and rich explanations in technical and legal terminologies [47,48].…”
Section: Topic Modeling Of Patentsmentioning
confidence: 99%
“…Text mining is a knowledge discovery technology that enables researchers to discern patterns and trends based on unstructured text. It is possible to extract hidden knowledge using approaches such as natural language analysis, information retrieval, information extraction, and data mining [17,[39][40][41][42][43][44][45][46]. Patent documents contain lengthy and rich explanations in technical and legal terminologies [47,48].…”
Section: Topic Modeling Of Patentsmentioning
confidence: 99%
“…det.M / D 1:In order to capture the specificities of each category c 1 ; : : : ; c N of documents,Mikawa et al [2012] extend the above work by learning N individual matrices M i . det.M / D 1:In order to capture the specificities of each category c 1 ; : : : ; c N of documents,Mikawa et al [2012] extend the above work by learning N individual matrices M i .…”
mentioning
confidence: 99%