Learning-Free Text Categorization

Ruch, Patrick; Baud, Robert H.; Geissbühler, Antoine

doi:10.1007/978-3-540-39907-0_28

Cited by 12 publications

(8 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Although the automatic assignment of MeSH indexing terms to a body of biomedical text has been extensively studied in the literature (see for example [1], [5], [6], [7], [8], [9], [10]), several major aspects of the task are often misunderstood or understated. Most issues pertain to the following topics:

multi-label assignment

scalability

compliance with indexing policies

…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A recent advance in the automatic indexing of the biomedical literature

Névéol

Shooshan

Humphrey

et al. 2009

Journal of Biomedical Informatics

View full text Add to dashboard Cite

The volume of biomedical literature has experienced explosive growth in recent years. This is reflected in the corresponding increase in the size of MEDLINE®, the largest bibliographic database of biomedical citations. Indexers at the U.S. National Library of Medicine (NLM) need efficient tools to help them accommodate the ensuing workload. After reviewing issues in the automatic assignment of Medical Subject Headings (MeSH® terms) to biomedical text, we focus more specifically on the new subheading attachment feature for NLM’s Medical Text Indexer (MTI). Natural Language Processing, statistical, and machine learning methods of producing automatic MeSH main heading/subheading pair recommendations were assessed independently and combined. The best combination achieves 48% precision and 30% recall. After validation by NLM indexers, a suitable combination of the methods presented in this paper was integrated into MTI as a subheading attachment feature producing MeSH indexing recommendations compliant with current state-of-the-art indexing practice.

show abstract

multi-label assignment

scalability

compliance with indexing policies

…”

Section: Introductionmentioning

confidence: 99%

“…Note that in [6], MeSH main headings are referred to as “MeSH categories”. Most efforts addressing MeSH indexing attempt to tackle indexing by solely using main headings which involves about 24,000 categories [1], [7]. However, in practice, MeSH indexing terms also include main heading/subheading pairs.…”

Section: Introductionmentioning

confidence: 99%

A recent advance in the automatic indexing of the biomedical literature

Névéol

Shooshan

Humphrey

et al. 2009

Journal of Biomedical Informatics

View full text Add to dashboard Cite

show abstract

“…For a short introduction on automatic text categorization in MEDLINE, the reader is referred to the NLM's indexing initiative [ 9 ]; for a detailed presentation of our vector space engine and a comparison with state-of-the-art systems, including NLM's tools, see [ 3 ](in this joint evaluation between four retrieval systems, our engine showed competitive performances) [ 10 ]. For a complete overview and evaluation of our categorization system applied on Medical Subject Headings and on the Gene Ontology, see [ 11 ].…”

Section: Methodsmentioning

confidence: 99%

Automatic medical encoding with SNOMED categories

Ruch

Gobeill

Lovis

et al. 2008

BMC Med Inform Decis Mak

Self Cite

View full text Add to dashboard Cite

show abstract

“…Il faut signaler que l'équipe CISMeF a réalisé en 2007 une évaluation de l'indexation automatique [26] [29] et NomIndex [14], cela en utilisant un corpus en français « misc » et les ressources en français du corpus « ENFR » [30]. Pour un rang égal à 10 (ce rang représente le nombre de mots clés classés selon un score calculé), les valeurs de rappel Cette comparaison entre les méthodes d'indexation automatique reste approximative, car les corpus de test sont différents et indexés manuellement par des indexeurs différents.…”

Section: Catégories Des Erreurs Fréquences Relatives Des Catégories Dunclassified