2017
DOI: 10.18547/gcb.2017.vol3.iss3.e58
|View full text |Cite
|
Sign up to set email alerts
|

Reducing the Search Space in Literature-Based Discovery by Exploring Outlier Documents: a Case Study in Finding Links Between Gut Microbiome and Alzheimer’s Disease

Abstract: Literature-based discovery tools have been often used to overcome the problem of fragmentation of science and to assist researchers in their process of cross-domain knowledge discovery.In this paper we propose a methodology for cross-domain literature-based discovery that focuses on outlier documents to reduce the search space of potential cross-domain links and to improve search efficiency. In a previous study, literature mining tools OntoGen for document clustering and CrossBee for cross-domain bridging term… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 30 publications
(60 reference statements)
0
6
0
Order By: Relevance
“…In terms of representation learning, our past LBD research that led to the lessons learned described in "Past LBD results and lessons learned" was based on using the standard TF-IDF weighted BoW vector representations of text documents [7,17,31,36]. On the other hand, the novel LBD methodology proposed in this paper in "Towards creative embeddings-based bisociative LBD" exploits contemporary representations of text documents using embeddings, given that current research in natural language processing demonstrates that representation learning using embeddings is much more effective than using the standard TF-IDF BoW vector representation.…”
Section: Embeddingsmentioning
confidence: 99%
See 3 more Smart Citations
“…In terms of representation learning, our past LBD research that led to the lessons learned described in "Past LBD results and lessons learned" was based on using the standard TF-IDF weighted BoW vector representations of text documents [7,17,31,36]. On the other hand, the novel LBD methodology proposed in this paper in "Towards creative embeddings-based bisociative LBD" exploits contemporary representations of text documents using embeddings, given that current research in natural language processing demonstrates that representation learning using embeddings is much more effective than using the standard TF-IDF BoW vector representation.…”
Section: Embeddingsmentioning
confidence: 99%
“…These experimental results indicate that it is justified that the search for b-terms can be focused on outlier documents, which contain a large majority of b-terms. Consequently, by focusing the exploration on outlier documents, the effort needed for finding cross-domain links is substantially reduced, as it requires to explore a When applying OntoGen on the documents of the new application domain using the Alzheimer's disease-gut microbiome domain pair [7], the OntoGen method uses domains A and C, and builds a joint document set A ∪ C . With this intention, two individual sets of documents (e.g., titles, abstracts, or full texts of scientific articles), one for each domain under research (namely, literature A on Alzheimer's disease and literature C on gut microbiome), were automatically retrieved from the PubMed database.…”
Section: Lesson Learned 1: Potential Of Outlier Documentsmentioning
confidence: 99%
See 2 more Smart Citations
“…For this reason, text mining and literature-based tools have become indispensable to assist researchers finding articles of interest. In this regard, a cross-domain knowledge discovery tool is described in this special issue by Cestnik et al [9]. This tool implements a search space reduction by eliminating potential outlier documents and, thus, improving search efficiency.…”
Section: Highlightsmentioning
confidence: 99%