BackgroundSince the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.ResultsGOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms.ConclusionGOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at:
CRISPR-Cas-mediated genome editing relies on guide RNAs that direct site-specific DNA cleavage facilitated by the Cas endonuclease. Here we report that chemical alterations to synthesized single guide RNAs (sgRNAs) enhance genome editing efficiency in human primary T cells and CD34+ hematopoietic stem and progenitor cells. Co-delivering chemically modified sgRNAs with Cas9 mRNA or protein is an efficient RNA- or ribonucleoprotein (RNP)-based delivery method for the CRISPR-Cas system, without the toxicity associated with DNA delivery. This approach is a simple and effective way to streamline the development of genome editing with the potential to accelerate a wide array of biotechnological and therapeutic applications of the CRISPR-Cas technology.
CpG island-like sequences are commonly thought to provide the sole signals for designating constitutively unmethylated regions in the genome, thus generating open chromatin domains within a sea of global repression. Using a new database obtained from comprehensive microarray analysis, we show that unmethylated regions (UMRs) seem to be formed during early embryogenesis, not as a result of CpG-ness, but rather through the recognition of specific sequence motifs closely associated with transcription start sites. This same system probably brings about the resetting of pluripotency genes during somatic cell reprogramming. The data also reveal a new class of nonpromoter UMRs that become de novo methylated in a tissue-specific manner during development, and this process may be involved in gene regulation. In short, we show that UMRs are an important aspect of genome structure that have a dynamic role in development.
Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.
IntroductionFew studies have performed expression profiling of both miRNA and mRNA from the same primary breast carcinomas. In this study we present and analyze data derived from expression profiling of 799 miRNAs in 101 primary human breast tumors, along with genome-wide mRNA profiles and extensive clinical information.MethodsWe investigate the relationship between these molecular components, in terms of their correlation with each other and with clinical characteristics. We use a systems biology approach to examine the correlative relationship between miRNA and mRNAs using statistical enrichment methods.ResultsWe identify statistical significant differential expression of miRNAs between molecular intrinsic subtypes, and between samples with different levels of proliferation. Specifically, we point to miRNAs significantly associated with TP53 and ER status. We also show that several cellular processes, such as proliferation, cell adhesion and immune response, are strongly associated with certain miRNAs. We validate the role of miRNAs in regulating proliferation using high-throughput lysate-microarrays on cell lines and point to potential drivers of this process.ConclusionThis study provides a comprehensive dataset as well as methods and system-level results that jointly form a basis for further work on understanding the role of miRNA in primary breast cancer.
An improved separation of the human serum N-glycome using hydrophilic interaction chromatography technology with UPLC is described, where more than 140 N-glycans were assigned. Using this technique, serum samples from 107 healthy controls and 62 newly diagnosed breast cancer patients were profiled. The most statistically significant alterations were observed in cancer patients compared with healthy controls: an increase in sialylation, branching, and outer-arm fucosylation and a decrease in high-mannosylated and biantennary core-fucosylated glycans. In the controls and cases combined systemic features were analyzed; serum estradiol was associated with increase in digalactosylated glycans, and higher mammographic density was associated with increase in biantennary digalactosylated glycans and with decrease in trisialylated and in outer-arm fucosylated glycans. Furthermore, particular glycans were altered in some features of the breast carcinomas; bisected biantennary nonfucosylated glycans were decreased in patients with progesterone receptor positive tumors, and core-fucosylated biantennary bisected monogalactosylated glycans were decreased in patients with the TP53 mutation. Systemic features show more significant associations with the serum N-glycome than do the features of the breast carcinomas. In conclusion, the UPLC-based glycan analysis technique described here reveals highly significant differences between healthy women and breast cancer patients. Significant associations with breast carcinoma and systemic features are described.
CRISPR systems have emerged as transformative tools for altering genomes in living cells with unprecedented ease, inspiring keen interest in increasing their specificity for perfectly matched targets. We have developed a novel approach for improving specificity by incorporating chemical modifications in guide RNAs (gRNAs) at specific sites in their DNA recognition sequence (‘guide sequence’) and systematically evaluating their on-target and off-target activities in biochemical DNA cleavage assays and cell-based assays. Our results show that a chemical modification (2′-O-methyl-3′-phosphonoacetate, or ‘MP’) incorporated at select sites in the ribose-phosphate backbone of gRNAs can dramatically reduce off-target cleavage activities while maintaining high on-target performance, as demonstrated in clinically relevant genes. These findings reveal a unique method for enhancing specificity by chemically modifying the guide sequence in gRNAs. Our approach introduces a versatile tool for augmenting the performance of CRISPR systems for research, industrial and therapeutic applications.
BackgroundmicroRNAs (miRNAs) regulate target genes at the post-transcriptional level and play important roles in cancer pathogenesis and development. Variation amongst individuals is a significant confounding factor in miRNA (or other) expression studies. The true character of biologically or clinically meaningful differential expression can be obscured by inter-patient variation. In this study we aim to identify miRNAs with consistent differential expression in multiple tumor types using a novel data analysis approach.MethodsUsing microarrays we profiled the expression of more than 700 miRNAs in 28 matched tumor/normal samples from 8 different tumor types (breast, colon, liver, lung, lymphoma, ovary, prostate and testis). This set is unique in putting emphasis on minimizing tissue type and patient related variability using normal and tumor samples from the same patient. We develop scores for comparing miRNA expression in the above matched sample data based on a rigorous characterization of the distribution of order statistics over a discrete state set, including exact p-values. Specifically, we compute a Rank Consistency Score (RCoS) for every miRNA measured in our data. Our methods are also applicable in various other contexts. We compare our methods, as applied to matched samples, to paired t-test and to the Wilcoxon Signed Rank test.ResultsWe identify consistent (across the cancer types measured) differentially expressed miRNAs. 41 miRNAs are under-expressed in cancer compared to normal, at FDR (False Discovery Rate) of 0.05 and 17 are over-expressed at the same FDR level. Differentially expressed miRNAs include known oncomiRs (e.g miR-96) as well as miRNAs that were not previously universally associated with cancer. Specific examples include miR-133b and miR-486-5p, which are consistently down regulated and mir-629* which is consistently up regulated in cancer, in the context of our cohort. Data is available in GEO. Software is available at: http://bioinfo.cs.technion.ac.il/people/zohar/RCoS/
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.