2021
DOI: 10.3390/genes12070998
|View full text |Cite
|
Sign up to set email alerts
|

Toward a Coronavirus Knowledge Graph

Abstract: This study builds a coronavirus knowledge graph (KG) by merging two information sources. The first source is Analytical Graph (AG), which integrates more than 20 different public datasets related to drug discovery. The second source is CORD-19, a collection of published scientific articles related to COVID-19. We combined both chemo genomic entities in AG with entities extracted from CORD-19 to expand knowledge in the COVID-19 domain. Before populating KG with those entities, we perform entity disambiguation o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 55 publications
(59 reference statements)
0
11
0
Order By: Relevance
“…Knowledge graphs have been used for many applications, including drug discovery and repurposing, target detection, and prediction (Alshahrani and Hoehndorf 2018;Moon et al 2021;Zheng et al 2021;Alves et al 2021). Other applications include integration and analysis of heterogeneous COVID-19 data (Steenwinckel et al 2020;Cernile et al 2021;Reese et al 2021;Domingo-Fernández et al 2021;Zhang et al 2021;Ostaszewski et al 2021;Chen et al 2021), oncology research (Zhu et al 2023;Zhao et al 2023;Jha et al), and gene-disease associations (Choi and Lee 2021;Alves et al 2021) among many others. The unifying intent in these projects is the meaningful The data loading pipeline contains four major steps (Figure 2): (1) Data selection and modeling, (2) Cleaning and preparing data, (3) Executing the UBKG "OWLNETS" pythons scripts and (4) Import using the Neo4j Bulk Import tool.…”
Section: Introductionmentioning
confidence: 99%
“…Knowledge graphs have been used for many applications, including drug discovery and repurposing, target detection, and prediction (Alshahrani and Hoehndorf 2018;Moon et al 2021;Zheng et al 2021;Alves et al 2021). Other applications include integration and analysis of heterogeneous COVID-19 data (Steenwinckel et al 2020;Cernile et al 2021;Reese et al 2021;Domingo-Fernández et al 2021;Zhang et al 2021;Ostaszewski et al 2021;Chen et al 2021), oncology research (Zhu et al 2023;Zhao et al 2023;Jha et al), and gene-disease associations (Choi and Lee 2021;Alves et al 2021) among many others. The unifying intent in these projects is the meaningful The data loading pipeline contains four major steps (Figure 2): (1) Data selection and modeling, (2) Cleaning and preparing data, (3) Executing the UBKG "OWLNETS" pythons scripts and (4) Import using the Neo4j Bulk Import tool.…”
Section: Introductionmentioning
confidence: 99%
“…We implement the COVID-19 KG as a case study as a COVID-19 KG could potentially play an important role for doctors, policymakers, epidemiologists and other domain experts currently trying to gain deeper insight into the crisis (Kejriwal, 2020). Even though several COVID-19 KGs have been constructed based on different resources (Wang et al , 2021a; Wise et al , 2020; Domingo-Fernández et al , 2021; Chen et al , 2020; Chatterjee et al , 2021), each of them is developed with a different purpose. For example, the COVID-19 literature KGs built by Wise et al (2020), Wang et al (2021a), Domingo-Fernández et al (2021) and Chen et al (2020) were applied for literature search, drug repurposing, pathophysiology discovery and multipurpose, respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Even though several COVID-19 KGs have been constructed based on different resources (Wang et al , 2021a; Wise et al , 2020; Domingo-Fernández et al , 2021; Chen et al , 2020; Chatterjee et al , 2021), each of them is developed with a different purpose. For example, the COVID-19 literature KGs built by Wise et al (2020), Wang et al (2021a), Domingo-Fernández et al (2021) and Chen et al (2020) were applied for literature search, drug repurposing, pathophysiology discovery and multipurpose, respectively. In addition, some of them contained a limited number of entities and relations (Domingo-Fernández et al , 2021; Chen et al , 2020), others were not rigorously validated (Wang et al , 2021a; Chen et al , 2020), raising a data-quality issue.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, there are only 10 entity types and 9484 facts within this KG. Coronavirus Knowledge Graph [ 22 ] only has 27 relation types. The CovidGraph project [ 23 ] built a COVID-19 graph that stores publications, case statistics, and molecular data in a Neo4j database, which enables exploring the underlying knowledge for finding specific genes, authors, articles, patents, proteins, existing treatments, and medications relevant to the entire family of coronaviruses.…”
Section: Introductionmentioning
confidence: 99%