Proceedings of the 2nd International Workshop on Patent Information Retrieval 2009
DOI: 10.1145/1651343.1651357
|View full text |Cite
|
Sign up to set email alerts
|

Phrase-based document categorization revisited

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…We used the Balanced Winnow classifier (Dagan et al, 1997;Littlestone, 1988) implemented in the Linguistic Classification System (LCS; Koster et al, 2003;Koster and Beney, 2009). This algorithm assigns two weights (w + and w − )…”
Section: Classification Algorithmmentioning
confidence: 99%
“…We used the Balanced Winnow classifier (Dagan et al, 1997;Littlestone, 1988) implemented in the Linguistic Classification System (LCS; Koster et al, 2003;Koster and Beney, 2009). This algorithm assigns two weights (w + and w − )…”
Section: Classification Algorithmmentioning
confidence: 99%
“…As a first step, there is a need to impose some matrix structure on the unstructured data so that it can be accessible to the existing mining algorithms. The most common approach is to create a term document matrix by extracting terms that lead the columns and rows led by documents (Koster and Beney, 2009). Extracting all terms can lead to dimension curse and affect the algorithm efficiency; hence, terms are selected based on the frequency of occurrence.…”
Section: Introductionmentioning
confidence: 99%
“…Textual reviews should be converted into the matrix format before applying any clustering algorithm. Many text mining [12,13] methods use TF-IDF approach, to represent documents [14], but it assumes all words are independent while words usually occur in contextual groups or phrases [15,16]. Table I specifies the significant mile stones in the evolution of context-based hotel recommender system.…”
Section: Introductionmentioning
confidence: 99%