Coronavirus disease 2019 (COVID-19) is known to cause multi-organ dysfunction 1 – 3 during acute infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), with some patients experiencing prolonged symptoms, termed post-acute sequelae of SARS-CoV-2 (refs. 4 , 5 ). However, the burden of infection outside the respiratory tract and time to viral clearance are not well characterized, particularly in the brain 3 , 6 – 14 . Here we carried out complete autopsies on 44 patients who died with COVID-19, with extensive sampling of the central nervous system in 11 of these patients, to map and quantify the distribution, replication and cell-type specificity of SARS-CoV-2 across the human body, including the brain, from acute infection to more than seven months following symptom onset. We show that SARS-CoV-2 is widely distributed, predominantly among patients who died with severe COVID-19, and that virus replication is present in multiple respiratory and non-respiratory tissues, including the brain, early in infection. Further, we detected persistent SARS-CoV-2 RNA in multiple anatomic sites, including throughout the brain, as late as 230 days following symptom onset in one case. Despite extensive distribution of SARS-CoV-2 RNA throughout the body, we observed little evidence of inflammation or direct viral cytopathology outside the respiratory tract. Our data indicate that in some patients SARS-CoV-2 can cause systemic infection and persist in the body for months.
The study of biological pathways is key to a large number of systems analyses. However, many relevant tools consider a limited number of pathway sources, missing out on many genes and gene-to-gene connections. Simply pooling several pathways sources would result in redundancy and the lack of systematic pathway interrelations. To address this, we exercised a combination of hierarchical clustering and nearest neighbor graph representation, with judiciously selected cutoff values, thereby consolidating 3215 human pathways from 12 sources into a set of 1073 SuperPaths. Our unification algorithm finds a balance between reducing redundancy and optimizing the level of pathway-related informativeness for individual genes. We show a substantial enhancement of the SuperPaths’ capacity to infer gene-to-gene relationships when compared with individual pathway sources, separately or taken together. Further, we demonstrate that the chosen 12 sources entail nearly exhaustive gene coverage. The computed SuperPaths are presented in a new online database, PathCards, showing each SuperPath, its constituent network of pathways, and its contained genes. This provides researchers with a rich, searchable systems analysis resource.Database URL: http://pathcards.genecards.org/
Alu exonization, which is an evolutionary pathway that creates primate-specific transcriptomic diversity, is a powerful tool for studying alternative-splicing regulation. Through bioinformatic analyses combined with experimental methodology, we identified the mutational changes needed to create functional 5' splice sites in Alu. We revealed a complex mechanism by which the sequence composition of the 5' splice site and its base pairing with the small nuclear RNA U1 govern alternative splicing. We show that in Alu-derived GC introns the strength of the base pairing between U1 snRNA and the 5' splice site controls the skipping/inclusion ratio of alternative splicing. Based on these findings, we identified 7810 Alus within the human genome that are prone to exonization. Mutations in these Alus may cause genetic disorders or contribute to human-specific protein diversity.
Comprehensive disease classification, integration and annotation are crucial for biomedical discovery. At present, disease compilation is incomplete, heterogeneous and often lacking systematic inquiry mechanisms. We introduce MalaCards, an integrated database of human maladies and their annotations, modeled on the architecture and strategy of the GeneCards database of human genes. MalaCards mines and merges 44 data sources to generate a computerized card for each of 16 919 human diseases. Each MalaCard contains disease-specific prioritized annotations, as well as inter-disease connections, empowered by the GeneCards relational database, its searches and GeneDecks set analyses. First, we generate a disease list from 15 ranked sources, using disease-name unification heuristics. Next, we use four schemes to populate MalaCards sections: (i) directly interrogating disease resources, to establish integrated disease names, synonyms, summaries, drugs/therapeutics, clinical features, genetic tests and anatomical context; (ii) searching GeneCards for related publications, and for associated genes with corresponding relevance scores; (iii) analyzing disease-associated gene sets in GeneDecks to yield affiliated pathways, phenotypes, compounds and GO terms, sorted by a composite relevance score and presented with GeneCards links; and (iv) searching within MalaCards itself, e.g. for additional related diseases and anatomical context. The latter forms the basis for the construction of a disease network, based on shared MalaCards annotations, embodying associations based on etiology, clinical features and clinical conditions. This broadly disposed network has a power-law degree distribution, suggesting that this might be an inherent property of such networks. Work in progress includes hierarchical malady classification, ontological mapping and disease set analyses, striving to make MalaCards an even more effective tool for biomedical research.Database URL: http://www.malacards.org/
Since 1998, the bioinformatics, systems biology, genomics and medical communities have enjoyed a synergistic relationship with the GeneCards database of human genes (http://www.genecards.org). This human gene compendium was created to help to introduce order into the increasing chaos of information flow. As a consequence of viewing details and deep links related to specific genes, users have often requested enhanced capabilities, such that, over time, GeneCards has blossomed into a suite of tools (including GeneDecks, GeneALaCart, GeneLoc, GeneNote and GeneAnnot) for a variety of analyses of both single human genes and sets thereof. In this paper, we focus on inhouse and external research activities which have been enabled, enhanced, complemented and, in some cases, motivated by GeneCards. In turn, such interactions have often inspired and propelled improvements in GeneCards. We describe here the evolution and architecture of this project, including examples of synergistic applications in diverse areas such as synthetic lethality in cancer, the annotation of genetic variations in disease, omics integration in a systems biology approach to kidney disease, and bioinformatics tools.
BackgroundNext generation sequencing (NGS) provides a key technology for deciphering the genetic underpinnings of human diseases. Typical NGS analyses of a patient depict tens of thousands non-reference coding variants, but only one or very few are expected to be significant for the relevant disorder. In a filtering stage, one employs family segregation, rarity in the population, predicted protein impact and evolutionary conservation as a means for shortening the variation list. However, narrowing down further towards culprit disease genes usually entails laborious seeking of gene-phenotype relationships, consulting numerous separate databases. Thus, a major challenge is to transition from the few hundred shortlisted genes to the most viable disease-causing candidates.ResultsWe describe a novel tool, VarElect (http://ve.genecards.org), a comprehensive phenotype-dependent variant/gene prioritizer, based on the widely-used GeneCards, which helps rapidly identify causal mutations with extensive evidence. The GeneCards suite offers an effective and speedy alternative, whereby >120 gene-centric automatically-mined data sources are jointly available for the task. VarElect cashes on this wealth of information, as well as on GeneCards’ powerful free-text Boolean search and scoring capabilities, proficiently matching variant-containing genes to submitted disease/symptom keywords. The tool also leverages the rich disease and pathway information of MalaCards, the human disease database, and PathCards, the unified pathway (SuperPaths) database, both within the GeneCards Suite. The VarElect algorithm infers direct as well as indirect links between genes and phenotypes, the latter benefitting from GeneCards’ diverse gene-to-gene data links in GenesLikeMe. Finally, our tool offers an extensive gene-phenotype evidence portrayal (“MiniCards”) and hyperlinks to the parent databases.ConclusionsWe demonstrate that VarElect compares favorably with several often-used NGS phenotyping tools, thus providing a robust facility for ranking genes, pointing out their likelihood to be related to a patient’s disease. VarElect’s capacity to automatically process numerous NGS cases, either in stand-alone format or in VCF-analyzer mode (TGex and VarAnnot), is indispensable for emerging clinical projects that involve thousands of whole exome/genome NGS analyses.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2722-2) contains supplementary material, which is available to authorized users.
The RecQ DNA helicase WRN is a synthetic lethal target for cancers with microsatellite instability (MSI), a form of genetic hypermutability arising from impaired mismatch repair 1-4 . WRN depletion induces widespread DNA double strand breaks (DSBs) in MSI cells, leading to cell cycle arrest and/or apoptosis. However, the mechanism by which WRN protects MSI cancers from DSBs remains unclear. Here, we demonstrate that TAdinucleotide repeats are highly unstable in MSI cells and exhibit surprisingly large-scale expansions, distinct from previously described insertion/deletion mutations of a few nucleotides 5 . We show that expanded TA repeats form non-B DNA secondary structures that stall replication forks, activate the ATR checkpoint kinase, and necessitate unwinding by the WRN helicase. In the absence of WRN, the expanded TA-dinucleotide repeats are susceptible to MUS81 nuclease cleavage, leading to massive chromosome shattering. Thus, our study uncovers a distinct biomarker within MSI tumors that underlies the synthetic lethal dependence on WRN, thereby supporting the development of WRN-based therapeutics.
Freely available for use at http://gloome.tau.ac.il/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.