This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Reactome, located at http://www.reactome.org is a curated, peer-reviewed resource of human biological processes. Given the genetic makeup of an organism, the complete set of possible reactions constitutes its reactome. The basic unit of the Reactome database is a reaction; reactions are then grouped into causal chains to form pathways. The Reactome data model allows us to represent many diverse processes in the human system, including the pathways of intermediary metabolism, regulatory pathways, and signal transduction, and high-level processes, such as the cell cycle. Reactome provides a qualitative framework, on which quantitative data can be superimposed. Tools have been developed to facilitate custom data entry and annotation by expert biologists, and to allow visualization and exploration of the finished dataset as an interactive process map. Although our primary curational domain is pathways from Homo sapiens, we regularly create electronic projections of human pathways onto other organisms via putative orthologs, thus making Reactome relevant to model organism research communities. The database is publicly available under open source terms, which allows both its content and its software infrastructure to be freely used and redistributed.
Reactome (http://www.reactome.org) is an expert-authored, peer-reviewed knowledgebase of human reactions and pathways that functions as a data mining resource and electronic textbook. Its current release includes 2975 human proteins, 2907 reactions and 4455 literature citations. A new entity-level pathway viewer and improved search and data mining tools facilitate searching and visualizing pathway data and the analysis of user-supplied high-throughput data sets. Reactome has increased its utility to the model organism communities with improved orthology prediction methods allowing pathway inference for 22 species and through collaborations to create manually curated Reactome pathway datasets for species including Arabidopsis, Oryza sativa (rice), Drosophila and Gallus gallus (chicken). Reactome's data content and software can all be freely used and redistributed under open source terms.
Reactome, an online curated resource for human pathway data, can be used to infer equivalent reactions in non-human species and as a tool to aid in the interpretation of microarrays and other high-throughput data sets.
Blood cells derive from hematopoietic stem cells through stepwise fating events. To characterize gene expression programs driving lineage choice we sequenced RNA from eight primary human hematopoietic progenitor populations representing the major myeloid commitment stages and the main lymphoid stage. We identify extensive cell-type specific expression changes: 6,711 genes and 10,724 transcripts, enriched in non-protein coding elements at early stages of differentiation. In addition, we discovered 7,881 novel splice junctions and 2,301 differentially used alternative splicing events, enriched in genes involved in regulatory processes. We demonstrate experimentally cell specific isoform usage, identifying NFIB as a regulator of megakaryocyte maturation -the platelet precursor. Our data highlight the complexity of fating events in closely related progenitor populations, the understanding of which is essential for the advancement of transplantation and regenerative medicine.
Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies. (Blood. 2009;113:e1-e9) IntroductionThe hematopoietic system represents one of the best-studied cellular differentiation processes in mammals. The differentiation of the hematopoietic stem cell (HSC) into the blood cell lineages, which is depicted as a stepwise process, generates diverse types of cells that perform many different functions. Historical observations of the blood, made in the late 18th century using some of the first microscopes, revealed that blood is composed of a heterogeneous population of cells that are distinct in number, morphology, and function. Since these early studies, the application of both technologic and methodologic advances to the investigation of blood has led to an ever-increasing understanding of the nature and function of the different types of blood cells. For example, the use of monoclonal antibodies (mAbs) and the designation of the cluster of differentiation (CD) markers, of which there are now more than 300, 1 allows hematologists to assign detailed phenotypes to malignant blood cells, which form the basis of decisions on therapeutic intervention.The value of the current understanding of the hematopoietic system to patient care is perhaps best illustrated in the field of malignancy where gene and protein expression profiles permit rapid and routine patient stratification. It is now possible to stratify patients with leukemia and lymphoma with unprecedented accuracy using gene expression profiles. Signature gene expression profiles may be used for diagnosis and predicting disease prognosis. In addition to studies in patients, gene expression profiles are available for a wide range of healthy tissue types. However, many of these resources, although broad in tissue coverage, are limited in the nu...
Summary SPRY and B30.2 are homologous domains which can be identified in 11 protein families encoded in the human genome. These include cell surface receptors of the immunoglobulin super‐family (BTNs), negative regulators of the JAK/STAT pathway (SOCS‐box SSB1–4) and proteins encoded by the numerous TRIM genes. Collectively, proteins containing SPRY and B30.2 domains cover a wide range of functions, including regulation of cytokine signalling (SOCS), RNA metabolism (DDX1, hnRNPs), intracellular calcium release (RyR receptors), immunity to retroviruses (TRIM5alpha) as well as regulatory and developmental processes (HERC1, Ash2L). In order to clarify the evolutionary relationship between the two domains, we compiled a curated database of SPRY and B30.2‐domain sequences. We show that while SPRY domains are evolutionarily ancient, B30.2 domains, found in BTN and TRIM proteins, are a more recent evolutionary adaptation, comprising the combination of SPRY with an additional domain, PRY. The combination of SPRY and PRY to produce B30.2 domains may have been selected and maintained as a component of immune defence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.