IEEE International Workshop on Semantic Computing and Systems 2008
DOI: 10.1109/wscs.2008.36
|View full text |Cite
|
Sign up to set email alerts
|

MSVM-kNN: Combining SVM and k-NN for Multi-class Text Classification

Abstract: Text objects occurring in image can provide much useful information for content based information retrieval and counting applications, because they contain much minute information related to the documents contents. However, extracting text from images and videos is a very difficult task due to the varying font, size, color, orientation, and malformation of text objects. Although a large number of text extraction approaches have been reported in the past work, no specific designed text model and character featu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 13 publications
0
15
0
Order By: Relevance
“…The performance measures used are Recall, Precision, F1 -measure and Accuracy [28]. These can be calculated in following equations 7, 8 and 9 respectively:…”
Section: Results and Performance Evaluatonmentioning
confidence: 99%
“…The performance measures used are Recall, Precision, F1 -measure and Accuracy [28]. These can be calculated in following equations 7, 8 and 9 respectively:…”
Section: Results and Performance Evaluatonmentioning
confidence: 99%
“…In another approach [6], four different accuracy measurements are utilized to compare three different algorithms (Shereen Khoja Stemmer, Tim Buckwalter Morphological analyzer and Tri-literal Root Extraction Algorithm) with gold standard. The methods of each stemmer to remove affixes are different, for example, Khoja extracted the word to get the stem by removing the longest affixes whilst Buckwalter used all prefixes to compile only one lexicon and Tri-literal used weighting of word depending on their position.…”
Section: Related Workmentioning
confidence: 99%
“…In the best state, the registered accuracy algorithm was 75%. According to the results, the accuracy of the Khoja stemmer had the highest ranked place, then tri-literal algorithm and then followed by Buckwalter morphological analyzer [6].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Various supervised machine learning techniques have been proposed in literature for the automatic classification of text documents such as Naïve Bayes [1] [17], Neural Networks [20], SVM (Support Vector Machine) [22] [23] [24], Decision Tree and also by combining approaches [12] [21] [25].…”
Section: Modeling: Selection Of Appropriate Machine Learning Techniqumentioning
confidence: 99%