ISBN 978-1-945626-22-7 ACL anthology W16-31xxii
PrefaceThe fourth BioASQ workshop on biomedical semantic indexing and question answering took place in Berlin, Germany on August 13th, 2016 and was hosted by the Humboldt University. The workshop was supported by the BioASQ project 1 , which organizes the corresponding annual challenge. The goals of the workshop were to present the results of the fourth BioASQ challenge and further the interaction with the wider community of biomedical semantic indexing and question answering. The presenters represented research teams from different parts of the globe and with different viewpoints to the problem. This contributed to a very lively and interesting discussion among the participants of the workshop. Six papers were presented during the workshop. All were selected by peer review for presentation. This volume includes 7 papers and one abstract: The first paper gives an overview of the challenge, including especially the datasets that were used throughout the challenges and the overall results achieved by the participants. The remaining six papers are those presented at the workshop. The first of these papers is a new extension of the MTI system. In particular, Learning to Rank methodology is used as a boosting component of the MTI system. The second paper present a system which includes several multi-label classifiers (MLC) that are combined in ensembles. Elastic-Search for indexing is the object of discourse of the third workshop paper. The fourth paper presents a system which uses TmTool in addition to MetaMap, to identify possible biomedical named entities, especially out-of-vocabulary concepts. They also introduced a unified classification interface for judging the relevance of each retrieved concept, document, and snippet, which can combine the relevant scores evidenced by various sources. The system presented in the fifth paper relies on the Hana Database for text processing. It uses the Stanford CoreNLP package for tokenizing the questions. Each of the tokens is then sent to the BioPortal and to the Hana database for concept retrieval. The concepts retrieved from the two stores are finally merged to a single list that is used to retrieve relevant text passages from the documents at hand. The last paper focuses on the retrieval of relevant documents and snippets. The proposed system uses a cluster-based language model. Then, it reranks the retrieved top-n sentences using five independent similarity models based on shallow semantic analysis.Finally, the proceedings also include the abstract of one paper that was presented in the poster session only, which describes an approach for extending the web services in order to retrieve the relevant documents, concepts, snippets and triples for the question-answering task.We wish to thank all who participated to the success of this workshop, especially the authors, reviewers, speakers and participants.
AbstractThe goal of this task is to push the research frontier towards hybrid information systems. We aim to promote syste...