The 2013 International Joint Conference on Neural Networks (IJCNN) 2013
DOI: 10.1109/ijcnn.2013.6707082
|View full text |Cite
|
Sign up to set email alerts
|

Automatic text categorization by a Granular Computing approach: Facing unbalanced data sets

Abstract: Text categorization is an interesting application of machine learning covering a wide range of possible applications, from document management systems to web mining. In designing such a system it is mandatory to correctly define both a suited preprocessing procedure and an effective document representation as closely related as possible to the semantic nature of document categories. To this aim, relying on a Granular Computing approach and considering a document as an ordered sequence of words, we propose a sy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…We generate N pat = 5 sequential patterns of mean length l sec = 8. The Training Set S tr is balanced (Possemato and Rizzi, 2013) and contains 100 patterned sequences (50 sequences per class) and 100 labeling sequences, while the Validation and Test Sets (S vs and S ts ) contain 50 patterned sequences (25 sequences per class) and 50 labeling sequences. The feature-based classifier adopted on the embedding space is based on the k-NN rule equipped with the Euclidean metric.…”
Section: Ecta2014-internationalconferenceonevolutionarycomputationthementioning
confidence: 99%
“…We generate N pat = 5 sequential patterns of mean length l sec = 8. The Training Set S tr is balanced (Possemato and Rizzi, 2013) and contains 100 patterned sequences (50 sequences per class) and 100 labeling sequences, while the Validation and Test Sets (S vs and S ts ) contain 50 patterned sequences (25 sequences per class) and 50 labeling sequences. The feature-based classifier adopted on the embedding space is based on the k-NN rule equipped with the Euclidean metric.…”
Section: Ecta2014-internationalconferenceonevolutionarycomputationthementioning
confidence: 99%