the Coronavirus Infectious Disease Ontology (CIDO) is a community-based ontology that supports coronavirus disease knowledge and data standardization, integration, sharing, and analysis. O ntologies, as the term is used in informatics, are structured vocabularies comprised of human-and computer-interpretable terms and relations that represent entities and relationships. Within informatics fields, ontologies play an important role in knowledge and data standardization, representation, integration, sharing and analysis. They have also become a foundation of artificial intelligence (AI) research. In what follows, we outline the Coronavirus Infectious Disease Ontology (CIDO), which covers multiple areas in the domain of coronavirus diseases, including etiology, transmission, epidemiology, pathogenesis, diagnosis, prevention, and treatment. We emphasize CIDO development relevant to COVID-19. Human coronaviruses have given rise to a series of major crises in global public health. Severe acute respiratory syndrome (SARS) emerged in China in November 2002, lasted for eight months and resulted in 8,098 confirmed human cases in 29 countries with 774 deaths (case-fatality rate: 9.6%) 1. Approximately ten years later in June 2012, the Middle East Respiratory Syndrome (MERS), another highly pathogenic coronavirus disease, was identified in Saudi Arabia. The MERS outbreak has caused 2,260 cases in 27 countries and 803 deaths (35.5%) 2. More recently, the World Health Organization (WHO) declared the Coronavirus Disease 2019 (COVID-19) outbreak as a pandemic on March 11, 2020, when there were 118,326 confirmed cases and 4,292 deaths. As of May 13, there have been over 4.4 million confirmed cases and over 295,000 deaths globally. Unfortunately, we still do not have available effective drugs and vaccines against these highly pathological coronaviruses. Extensive studies have been conducted on coronaviruses, the results of many of which exist in publicly available data repositories such as GEO 3. Publications concerning COVID-19 have exploded in recent months, and new clinical trials have been and are being conducted to develop drugs and vaccines against COVID-19, 1,430 of which have been registered in ClinicalTrials.
Coronavirus-infected diseases have posed great threats to human health. In past years, highly infectious coronavirus-induced diseases, including COVID-19, SARS, and MERS, have resulted in world-wide severe infections. Our literature annotations identified 72 chemical drugs and 27 antibodies effective against at least one human coronavirus infection in vitro or in vivo. Many of these drugs inhibit viral entry to cells and viral replication inside cells or modulate host immune responses. Many antimicrobial drugs, including antimalarial (e.g., chloroquine and mefloquine) and antifungal (e.g., terconazole and rapamycin) drugs as well as antibiotics (e.g., teicoplanin and azithromycin) were associated with anti-coronavirus activity. A few drugs, including remdesivir, chloroquine phosphate, favipiravir, and tocilizumab, have already been reported to be effective in treating COVID-19. After mapping our identified drugs to three ontologies ChEBI, NDF-RT, and DrON, many features such as roles and mechanisms of action (MoAs) of these drugs were identified and categorized. For example, out of 35 drugs with MoA annotations in NDF-RT, 34 have MoAs of different types of inhibitors and antagonists. Two clustering analyses, one based on ChEBI-based semantic similarity, the other based on drug chemical similarity, were performed to cluster over 60 drugs to new categories. Moreover, PCA analysis of anti-coronavirus drugs found differences in physicochemical properties between those inhibiting viral entry and viral replication. A total of 137 host genes were identified as the targets of 47 anti-coronavirus drugs, resulting in a network of 370 interactions among these drugs and targets. Chlorpromazine, dasatinib, and anisomycin are the hubs of the drug-target network with the highest number of connected target proteins. Many enriched pathways such as calcium signaling and neuroactive ligand-receptor interaction pathways were identified. These findings may be used to facilitate drug repurposing against COVID-19.
Our systematic literature collection and annotation identified 106 chemical drugs and 31 antibodies effective against the infection of at least one human coronavirus (including SARS-CoV, SAR-CoV-2, and MERS-CoV) in vitro or in vivo in an experimental or clinical setting. A total of 163 drug protein targets were identified, and 125 biological processes involving the drug targets were significantly enriched based on a Gene Ontology (GO) enrichment analysis. The Coronavirus Infectious Disease Ontology (CIDO) was used as an ontological platform to represent the anti-coronaviral drugs, chemical compounds, drug targets, biological processes, viruses, and the relations among these entities. In addition to new term generation, CIDO also adopted various terms from existing ontologies and developed new relations and axioms to semantically represent our annotated knowledge. The CIDO knowledgebase was systematically analyzed for scientific insights. To support rational drug design, a “Host-coronavirus interaction (HCI) checkpoint cocktail” strategy was proposed to interrupt the important checkpoints in the dynamic HCI network, and ontologies would greatly support the design process with interoperable knowledge representation and reasoning.
Background The current COVID-19 pandemic and the previous SARS/MERS outbreaks of 2003 and 2012 have resulted in a series of major global public health crises. We argue that in the interest of developing effective and safe vaccines and drugs and to better understand coronaviruses and associated disease mechenisms it is necessary to integrate the large and exponentially growing body of heterogeneous coronavirus data. Ontologies play an important role in standard-based knowledge and data representation, integration, sharing, and analysis. Accordingly, we initiated the development of the community-based Coronavirus Infectious Disease Ontology (CIDO) in early 2020. Results As an Open Biomedical Ontology (OBO) library ontology, CIDO is open source and interoperable with other existing OBO ontologies. CIDO is aligned with the Basic Formal Ontology and Viral Infectious Disease Ontology. CIDO has imported terms from over 30 OBO ontologies. For example, CIDO imports all SARS-CoV-2 protein terms from the Protein Ontology, COVID-19-related phenotype terms from the Human Phenotype Ontology, and over 100 COVID-19 terms for vaccines (both authorized and in clinical trial) from the Vaccine Ontology. CIDO systematically represents variants of SARS-CoV-2 viruses and over 300 amino acid substitutions therein, along with over 300 diagnostic kits and methods. CIDO also describes hundreds of host-coronavirus protein-protein interactions (PPIs) and the drugs that target proteins in these PPIs. CIDO has been used to model COVID-19 related phenomena in areas such as epidemiology. The scope of CIDO was evaluated by visual analysis supported by a summarization network method. CIDO has been used in various applications such as term standardization, inference, natural language processing (NLP) and clinical data integration. We have applied the amino acid variant knowledge present in CIDO to analyze differences between SARS-CoV-2 Delta and Omicron variants. CIDO's integrative host-coronavirus PPIs and drug-target knowledge has also been used to support drug repurposing for COVID-19 treatment. Conclusion CIDO represents entities and relations in the domain of coronavirus diseases with a special focus on COVID-19. It supports shared knowledge representation, data and metadata standardization and integration, and has been used in a range of applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.