The CSS sets substantially outperformed other prognostic predictors of stage 2 CRC. They are more accurate and robust for prognostic predictions and facilitate the identification of patients with stage II disease who could gain survival benefit from fluorouracil-based adjuvant chemotherapy.
Based on a statistical model, we show that even in technical replicate tests using identical samples, it is highly likely that the selected DEG lists will be very inconsistent in the presence of small measurement variations. Therefore, the apparently low reproducibility of DEG detection from current technical replicate tests does not indicate low quality of microarray technology. We also demonstrate that heterogeneous biological variations existing in real cancer data will further reduce the overall reproducibility of DEG detection. Nevertheless, in small subsamples from both simulated and real data, the actual false discovery rate (FDR) for each DEG list tends to be low, suggesting that each separately determined list may comprise mostly true DEGs. Rather than simply counting the overlaps of the discovery lists from different studies for a complex disease, novel metrics are needed for evaluating the reproducibility of discoveries characterized with correlated molecular changes. Supplementaty information: Supplementary data are available at Bioinformatics online.
Motivation: According to current consistency metrics such as percentage of overlapping genes (POG), lists of differentially expressed genes (DEGs) detected from different microarray studies for a complex disease are often highly inconsistent. This irreproducibility problem also exists in other high-throughput post-genomic areas such as proteomics and metabolism. A complex disease is often characterized with many coordinated molecular changes, which should be considered when evaluating the reproducibility of discovery lists from different studies.Results: We proposed metrics percentage of overlapping genes-related (POGR) and normalized POGR (nPOGR) to evaluate the consistency between two DEG lists for a complex disease, considering correlated molecular changes rather than only counting gene overlaps between the lists. Based on microarray datasets of three diseases, we showed that though the POG scores for DEG lists from different studies for each disease are extremely low, the POGR and nPOGR scores can be rather high, suggesting that the apparently inconsistent DEG lists may be highly reproducible in the sense that they are actually significantly correlated. Observing different discovery results for a disease by the POGR and nPOGR scores will obviously reduce the uncertainty of the microarray studies. The proposed metrics could also be applicable in many other high-throughput post-genomic areas.Contact: guoz@ems.hrbmu.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.
Background: The chromatin regulator BRPF1 recognizes different epigenetic marks and activates multiple histone acetyltransferases, but little is known about its biological functions in mammals. Results: Forebrain-specific inactivation of the mouse gene causes neocortical abnormalities and partial agenesis of the corpus callosum. Conclusion: Mammalian BRPF1 is important for forebrain development. Significance: This study links a unique chromatin regulator to mammalian brain development.
In high-throughput studies of diseases, terms enriched with disease-related genes based on Gene Ontology (GO) are routinely found. However, most current algorithms used to find significant GO terms cannot handle the redundancy that results from the dependencies of GO terms. Simply based on some numerical considerations, current algorithms developed for reducing this redundancy may produce results that do not account for biologically interesting cases. In this article, we present several rules used to design a tool called GO-function for extracting biologically relevant terms from statistically significant GO terms for a disease. Using one gene expression profile for colorectal cancer, we compared GO-function with four algorithms designed to treat redundancy. Then, we validated results obtained in this data set by GO-function using another data set for colorectal cancer. Our analysis showed that GO-function can identify disease-related terms that are more statistically and biologically meaningful than those found by the other four algorithms.
Lysine acetylation has recently emerged as an important post-translational modification in diverse organisms, but relatively little is known about its roles in mammalian development and stem cells. Bromodomain- and PHD finger-containing protein 1 (BRPF1) is a multidomain histone binder and a master activator of three lysine acetyltransferases, MOZ, MORF and HBO1, which are also known as KAT6A, KAT6B and KAT7, respectively. While the MOZ and MORF genes are rearranged in leukemia, the MORF gene is also mutated in prostate and other cancers and in four genetic disorders with intellectual disability. Here we show that forebrain-specific inactivation of the mouse Brpf1 gene causes hypoplasia in the dentate gyrus, including underdevelopment of the suprapyramidal blade and complete loss of the infrapyramidal blade. We trace the developmental origin to compromised Sox2+ neural stem cells and Tbr2+ intermediate neuronal progenitors. We further demonstrate that Brpf1 loss deregulates neuronal migration, cell cycle progression and transcriptional control, thereby causing abnormal morphogenesis of the hippocampus. These results link histone binding and acetylation control to hippocampus development and identify an important epigenetic regulator for patterning the dentate gyrus, a brain structure critical for learning, memory and adult neurogenesis.
Background: Deletion of the mouse Brpf1 gene causes embryonic lethality, but the resulting defects await characterization. Results: The vasculature, neural tube, and cell proliferation are abnormal in the mutant. Conclusion: Brpf1 is important for embryo development and cell cycle control. Significance: This study identifies a critical role of a multivalent chromatin regulator in embryogenesis and cell proliferation.
Recent tumor genome sequencing confirmed that one tumor often consists of multiple cell subpopulations (clones) which bear different, but related, genetic profiles such as mutation and copy number variation profiles. Thus far, one tumor has been viewed as a whole entity in cancer functional studies. With the advances of genome sequencing and computational analysis, we are able to quantify and computationally dissect clones from tumors, and then conduct clone-based analysis. Emerging technologies such as single-cell genome sequencing and RNA-Seq could profile tumor clones. Thus, we should reconsider how to conduct cancer systems biology studies in the genome sequencing era. We will outline new directions for conducting cancer systems biology by considering that genome sequencing technology can be used for dissecting, quantifying and genetically characterizing clones from tumors. Topics discussed in Part 1 of this review include computationally quantifying of tumor subpopulations; clone-based network modeling, cancer hallmark-based networks and their high-order rewiring principles and the principles of cell survival networks of fast-growing clones.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.