This paper reports on a shared task involving the assignment of ICD-9-CM codes to radiology reports. Two features distinguished this task from previous shared tasks in the biomedical domain. One is that it resulted in the first freely distributable corpus of fully anonymized clinical text. This resource is permanently available and will (we hope) facilitate future research. The other key feature of the task is that it required categorization with respect to a large and commercially significant set of labels. The number of participants was larger than in any previous biomedical challenge task. We describe the data production process and the evaluation measures, and give a preliminary analysis of the results. Many systems performed at levels approaching the inter-coder agreement, suggesting that human-like performance on this task is within the reach of currently available technologies.
Limited configuration interaction methods suffer from size-extensivity errors. The origin and behavior of these errors is discussed and new versions of single and multireference corrections are presented. Accuracy of the new and various other size-extensivity corrections used in the literature is discussed and compared in a series of model calculations and calculations on small molecules. None of the commonly used multireference corrections restores the size extensivity of multireference configuration interaction calculations. Our correction behaves correctly for the special case of a reference state composed from all singly and doubly excited configurations. Formulas for size extensivity corrections in the variational-perturbation method are given and discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.