Classifier subset selection for biomedical named entity recognition

Dimililer, Nazife; Varoğlu, Ekrem; Altınçay, Hakan

doi:10.1007/s10489-008-0124-0

Cited by 14 publications

(13 citation statements)

References 40 publications

(93 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Methods using (8) are referred as Max-Dependency (MD) approaches. Regardless of the searching algorithm, MD faces difficulties in estimating the multivariate density functions, which requires not only a high computational cost but also a large number of samples.…”

Section: Related Workmentioning

confidence: 99%

“…Therefore, removing irrelevant features helps speed up the learning process and alleviates the effect of the curse of dimensionality. Due to the capabilities, feature selection has been largely applied in many applications, including text classification [6,12], bio-informatics [8,24,32], intrusion detection [18,27], and image retrieval [5,9]. Furthermore, feature selection facilitates the data visualization and understanding [14,17,31].…”

mentioning

confidence: 99%

See 1 more Smart Citation

A novel feature selection method based on normalized mutual information

2011

View full text Add to dashboard Cite

In this paper, a novel feature selection method based on the normalization of the well-known mutual information measurement is presented. Our method is derived from an existing approach, the max-relevance and minredundancy (mRMR) approach. We, however, propose to normalize the mutual information used in the method so that the domination of the relevance or of the redundancy can be eliminated. We borrow some commonly used recognition models including Support Vector Machine (SVM), k-Nearest-Neighbor (kNN), and Linear Discriminant Analysis (LDA) to compare our algorithm with the original (mRMR) and a recently improved version of the mRMR, the Normalized Mutual Information Feature Selection (NMIFS) algorithm. To avoid data-specific statements, we conduct our classification experiments using various datasets from the UCI machine learning repository. The results confirm that our feature selection method is more robust than the others with regard to classification accuracy.

show abstract

Section: Related Workmentioning

confidence: 99%

mentioning

confidence: 99%

A novel feature selection method based on normalized mutual information

2011

View full text Add to dashboard Cite

show abstract

“…The recall value is identical in most of the cases but, when last quartile contains valid definitions, the value is lower. Considering the reduced amount of definitions available in Acronym Finder (10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20), this fact affects significantly the final performance (lower FMeasure). Even though, in most cases, results are slightly better due to the improvement in selection accuracy.…”

Section: Web-based Reliability Evaluationmentioning

confidence: 99%

“…New acronyms are defined every day for almost every possible domain of knowledge. This is especially evident in domains such as biomedicine [15,39]. • They are highly polysemic.…”

Section: Introductionmentioning

confidence: 99%

Automatic extraction of acronym definitions from the Web

Sánchez

Isern

2009

Appl Intell

View full text Add to dashboard Cite

Acronyms are widely used to abbreviate and stress important concepts. The discovery of the definitions associated to an acronym is an important matter in order to support language processing and knowledge-related tasks as information retrieval, ontology mapping or question answering. Acronyms represent a very dynamic and unbounded topic that is constantly evolving. Manual attempts to compose a global scale dictionary of acronym-definition pairs result in an overwhelming amount of work and limited results. Attending these shortcomings, this paper presents an automatic and unsupervised methodology to generate acronyms and extract their potential definitions from the Web. The method has been designed to minimise the set of constraints, offering a domain and -partially-language independent solution, and to exploit the Web in order to create large and general acronym-definition sets. Results have been manually evaluated against the largest manually built acronym repository: Acronym Finder. The evaluation shows that the proposed approach is able to improve the coverage of manual attempts maintaining a high precision.

show abstract

“…A huge amount of available online textual documents in the field of biomedicine leads to great difficulties for building question answering, or information retrieval systems [DVA09]. Luckily, multi-document summarization can assist extracting the essential information from those documents and hereby benefit those systems.…”

Section: Overviewmentioning

confidence: 99%

Mining the Online Social Network Data: Influence, Summarization, and Organization

Li¹

View full text Add to dashboard Cite

Classifier subset selection for biomedical named entity recognition

Cited by 14 publications

References 40 publications

A novel feature selection method based on normalized mutual information

A novel feature selection method based on normalized mutual information

Automatic extraction of acronym definitions from the Web

Mining the Online Social Network Data: Influence, Summarization, and Organization

Contact Info

Product

Resources

About