Xiaopeng Tao scite author profile

kNN classifier is widely used in text categorization, however, kNN has the large computational and store requirements, and its performance also suffers from uneven distribution of training data. Usually, condensing technique is resorted to reducing the noises of training data and decreasing the cost of time and space. Traditional condensing technique picks up samples in a random manner when initialization. Though random sampling is one means to reduce outliers, the extremely stochastic may lead to bad performance sometimes, that is, advantages of sampling may be suppressed. To avoid such a misfortune, we propose a variation of traditional condensing technique. Experiment results illustrate this strategy can solve above problems effectively.

show abstract

An Effective Method To Improve kNN Text Classifier

Hao

Tao

Zhang

et al. 2007

View full text Add to dashboard Cite

Time-Based Slope One Algorithm

Lin¹,

Tao²,

Yu³

View full text Add to dashboard Cite

Comma restoration using constituency information

Shieber

Tao

2003

View full text Add to dashboard Cite

Automatic restoration of punctuation from unpunctuated text has application in improving the fluency and applicability of speech recognition systems. We explore the possibility that syntactic information can be used to improve the performance of an HMM-based system for restoring punctuation (specifically, commas) in text. Our best methods reduce sentence error rate substantially-by some 20%, with an additional 8% reduction possible given improvements in extraction of the requisite syntactic information. 1 In an old unattributed joke, an English professor asks some students to punctuate the word sequence "Woman without her man is nothing". The male students preferred "Woman, without her man, is nothing." whereas the female proposed "Woman! Without her, man is nothing." No, it's not funny, but it does make the point.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiaopeng Tao

Retrospective analysis of therapeutic effect and prognostic factors on early glottic carcinoma

An Improved Condensing Algorithm

An Effective Method To Improve kNN Text Classifier

Time-Based Slope One Algorithm

Comma restoration using constituency information

Contact Info

Product

Resources

About