The Human Phenotype Ontology (HPO) project, available at http://www.human-phenotype-ontology.org, provides a structured, comprehensive and well-defined set of 10,088 classes (terms) describing human phenotypic abnormalities and 13,326 subclass relations between the HPO classes. In addition we have developed logical definitions for 46% of all HPO classes using terms from ontologies for anatomy, cell types, function, embryology, pathology and other domains. This allows interoperability with several resources, especially those containing phenotype information on model organisms such as mouse and zebrafish. Here we describe the updated HPO database, which provides annotations of 7,278 human hereditary syndromes listed in OMIM, Orphanet and DECIPHER to classes of the HPO. Various meta-attributes such as frequency, references and negations are associated with each annotation. Several large-scale projects worldwide utilize the HPO for describing phenotype information in their datasets. We have therefore generated equivalence mappings to other phenotype vocabularies such as LDDB, Orphanet, MedDRA, UMLS and phenoDB, allowing integration of existing datasets and interoperability with multiple biomedical resources. We have created various ways to access the HPO database content using flat files, a MySQL database, and Web-based tools. All data and documentation on the HPO project can be found online.
Less than half of patients with suspected genetic disease receive a molecular diagnosis. We have therefore integrated next-generation sequencing (NGS), bioinformatics, and clinical data into an effective diagnostic workflow. We used variants in the 2741 established Mendelian disease genes [the disease-associated genome (DAG)] to develop a targeted enrichment DAG panel (7.1 Mb), which achieves a coverage of 20-fold or better for 98% of bases. Furthermore, we established a computational method [Phenotypic Interpretation of eXomes (PhenIX)] that evaluated and ranked variants based on pathogenicity and semantic similarity of patients’ phenotype described by Human Phenotype Ontology (HPO) terms to those of 3991 Mendelian diseases. In computer simulations, ranking genes based on the variant score put the true gene in first place less than 5% of the time; PhenIX placed the correct gene in first place more than 86% of the time. In a retrospective test of PhenIX on 52 patients with previously identified mutations and known diagnoses, the correct gene achieved a mean rank of 2.1. In a prospective study on 40 individuals without a diagnosis, PhenIX analysis enabled a diagnosis in 11 cases (28%, at a mean rank of 2.4). Thus, the NGS of the DAG followed by phenotype-driven bioinformatic analysis allows quick and effective differential diagnostics in medical genetics.
Hyperphosphatasia mental retardation (HPMR) syndrome is an autosomal recessive form of mental retardation with distinct facial features and elevated serum alkaline phosphatase. We performed whole-exome sequencing in three siblings of a nonconsanguineous union with HPMR and performed computational inference of regions identical by descent in all siblings to establish PIGV, encoding a member of the GPI-anchor biosynthesis pathway, as the gene mutated in HPMR. We identified homozygous or compound heterozygous mutations in PIGV in three additional families.
Genomic duplications encompassing BHLHA9 are associated with SHFLD and non-Mendelian inheritance characterised by a high degree of non-penetrance with sex bias. Knock-down of bhlha9 in zebrafish causes severe reduction defects of the pectoral fin, indicating a role for this gene in limb development.
Laurin-Sandrow syndrome (LSS) is a rare autosomal dominant disorder characterized by polysyndactyly of hands and/or feet, mirror image duplication of the feet, nasal defects, and loss of identity between fibula and tibia. The genetic basis of LSS is currently unknown. LSS shows phenotypic overlap with Haas-type polysyndactyly (HTS) regarding the digital phenotype. Here we report on five unrelated families with overlapping microduplications encompassing the Sonic hedgehog (SHH) limb enhancer ZPA regulatory sequence (ZRS) on chromosome 7q36. Clinically, the patients show polysyndactyly phenotypes and various types of lower limb malformations ranging from syndactyly to mirror image polydactyly with duplications of the fibulae. We show that larger duplications of the ZRS region (>80 kb) are associated with HTS, whereas smaller duplications (<80 kb) result in the LSS phenotype. On the basis of our data, the latter can be clearly distinguished from HTS by the presence of mirror image polysyndactyly of the feet with duplication of the fibula. Our results expand the clinical phenotype of the ZRS-associated syndromes and suggest that smaller duplications (<80 kb) are associated with a more severe phenotype. In addition, we show that these small microduplications within the ZRS region are the underlying genetic cause of Laurin-Sandrow syndrome.
Mouse phenotype data represents a valuable resource for the identification of disease-associated genes, especially where the molecular basis is unknown and there is no clue to the candidate gene’s function, pathway involvement or expression pattern. However, until recently these data have not been systematically used due to difficulties in mapping between clinical features observed in humans and mouse phenotype annotations. Here, we describe a semantic approach to solve this problem and demonstrate highly significant recall of known disease-gene associations and orthology relationships. A web application (MouseFinder; www.mousemodels.org) has been developed to allow users to search the results of our whole-phenome comparison of human and mouse. We demonstrate its use in identifying ARTN as a strong candidate gene within the 1p34.1-p32 mapped locus for a hereditary form of ptosis.
Growth and Differentiation Factor 5 (GDF5) is a secreted growth factor that belongs to the Bone Morphogenetic Protein (BMP) family and plays a pivotal role during limb development. GDF5 is a susceptibility gene for osteoarthritis (OA) and mutations in GDF5 are associated with a wide variety of skeletal malformations ranging from complex syndromes such as acromesomelic chondrodysplasias to isolated forms of brachydactylies or multiple synostoses syndrome 2 (SYNS2). Here, we report on a family with an autosomal dominant inherited combination of SYNS2 and additional brachydactyly type A1 (BDA1) caused by a single point mutation in GDF5 (p.W414R). Functional studies, including chondrogenesis assays with primary mesenchymal cells, luciferase reporter gene assays and Surface Plasmon Resonance analysis, of the GDF5W414R variant in comparison to other GDF5 mutations associated with isolated BDA1 (p.R399C) or SYNS2 (p.E491K) revealed a dual pathomechanism characterized by a gain- and loss-of-function at the same time. On the one hand insensitivity to the main GDF5 antagonist NOGGIN (NOG) leads to a GDF5 gain of function and subsequent SYNS2 phenotype. Whereas on the other hand, a reduced signaling activity, specifically via the BMP receptor type IA (BMPR1A), is likely responsible for the BDA1 phenotype. These results demonstrate that one mutation in the overlapping interface of antagonist and receptor binding site in GDF5 can lead to a GDF5 variant with pathophysiological relevance for both, BDA1 and SYNS2 development. Consequently, our study assembles another part of the molecular puzzle of how loss and gain of function mutations in GDF5 affect bone development in hands and feet resulting in specific types of brachydactyly and SYNS2. These novel insights into the biology of GDF5 might also provide further clues on the pathophysiology of OA.
Concept recognition tools rely on the availability of textual corpora to assess their performance and enable the identification of areas for improvement. Typically, corpora are developed for specific purposes, such as gene name recognition. Gene and protein name identification are longstanding goals of biomedical text mining, and therefore a number of different corpora exist. However, phenotypes only recently became an entity of interest for specialized concept recognition systems, and hardly any annotated text is available for performance testing and training. Here, we present a unique corpus, capturing text spans from 228 abstracts manually annotated with Human Phenotype Ontology (HPO) concepts and harmonized by three curators, which can be used as a reference standard for free text annotation of human phenotypes. Furthermore, we developed a test suite for standardized concept recognition error analysis, incorporating 32 different types of test cases corresponding to 2164 HPO concepts. Finally, three established phenotype concept recognizers (NCBO Annotator, OBO Annotator and Bio-LarK CR) were comprehensively evaluated, and results are reported against both the text corpus and the test suites. The gold standard and test suites corpora are available from http://bio-lark.org/hpo_res.html.Database URL: http://bio-lark.org/hpo_res.html
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.