Although accumulating evidence has provided insight into the various functions of long-non-coding RNAs (lncRNAs), the exact functions of the majority of such transcripts are still unknown. Here, we report the first computational annotation of lncRNA functions based on public microarray expression profiles. A coding–non-coding gene co-expression (CNC) network was constructed from re-annotated Affymetrix Mouse Genome Array data. Probable functions for altogether 340 lncRNAs were predicted based on topological or other network characteristics, such as module sharing, association with network hubs and combinations of co-expression and genomic adjacency. The functions annotated to the lncRNAs mainly involve organ or tissue development (e.g. neuron, eye and muscle development), cellular transport (e.g. neuronal transport and sodium ion, acid or lipid transport) or metabolic processes (e.g. involving macromolecules, phosphocreatine and tyrosine).
Facilitated by the rapid progress of high-throughput sequencing technology, a large number of long noncoding RNAs (lncRNAs) have been identified in mammalian transcriptomes over the past few years. LncRNAs have been shown to play key roles in various biological processes such as imprinting control, circuitry controlling pluripotency and differentiation, immune responses and chromosome dynamics. Notably, a growing number of lncRNAs have been implicated in disease etiology. With the increasing number of published lncRNA studies, the experimental data on lncRNAs (e.g. expression profiles, molecular features and biological functions) have accumulated rapidly. In order to enable a systematic compilation and integration of this information, we have updated the NONCODE database (http://www.noncode.org) to version 3.0 to include the first integrated collection of expression and functional lncRNA data obtained from re-annotated microarray studies in a single database. NONCODE has a user-friendly interface with a variety of search or browse options, a local Genome Browser for visualization and a BLAST server for sequence-alignment search. In addition, NONCODE provides a platform for the ongoing collation of ncRNAs reported in the literature. All data in NONCODE are open to users, and can be downloaded through the website or obtained through the SOAP API and DAS services.
NONCODE is an integrated knowledge database dedicated to non-coding RNAs (ncRNAs), that is to say, RNAs that function without being translated into proteins. All ncRNAs in NONCODE were filtered automatically from literature and GenBank, and were later manually curated. The distinctive features of NONCODE are as follows: (i) the ncRNAs in NONCODE include almost all the types of ncRNAs, except transfer RNAs and ribosomal RNAs. (ii) All ncRNA sequences and their related information (e.g. function, cellular role, cellular location, chromosomal information, etc.) in NONCODE have been confirmed manually by consulting relevant literature: more than 80% of the entries are based on experimental data. (iii) Based on the cellular process and function, which a given ncRNA is involved in, we introduced a novel classification system, labeled process function class, to integrate existing classification systems. (iv) In addition, some 1100 ncRNAs have been grouped into nine other classes according to whether they are specific to gender or tissue or associated with tumors and diseases, etc. (v) NONCODE provides a user-friendly interface, a visualization platform and a convenient search option, allowing efficient recovery of sequence, regulatory elements in the flanking sequences, secondary structure, related publications and other information. The first release of NONCODE (v1.0) contains 5339 non-redundant sequences from 861 organisms, including eukaryotes, eubacteria, archaebacteria, virus and viroids. Access is free for all users through a web interface at http://noncode.bioinfo.org.cn.
Recent evidence points to considerable transcription occurring in non-protein-coding regions of eukaryote genomes. However, their lack of conservation and demonstrated function have created controversy over whether these transcripts are functional. Applying a novel cloning strategy, we have cloned 100 novel and 61 known or predicted Caenorhabditis elegans full-length ncRNAs. Studying the genomic environment and transcriptional characteristics have shown that two-thirds of all ncRNAs, including many intronic snoRNAs, are independently transcribed under the control of ncRNA-specific upstream promoter elements. Furthermore, the transcription levels of at least 60% of the ncRNAs vary with developmental stages. We identified two new classes of ncRNAs, stem-bulge RNAs (sbRNAs) and snRNA-like RNAs (snlRNAs), both featuring distinct internal motifs, secondary structures, upstream elements, and high and developmentally variable expression. Most of the novel ncRNAs are conserved in Caenorhabditis briggsae, but only one homolog was found outside the nematodes. Preliminary estimates indicate that the C. elegans transcriptome contains ∼2700 small non-coding RNAs, potentially acting as regulatory elements in nematode development.
BackgroundMicroRNAs (miRNAs) are small noncoding RNAs about 22 nt long that negatively regulate gene expression at the post-transcriptional level. Their key effects on various biological processes, e.g., embryonic development, cell division, differentiation and apoptosis, are widely recognized. Evidence suggests that aberrant expression of miRNAs may contribute to many types of human diseases, including cancer. Here we present a database of differentially expressed miRNAs in human cancers (dbDEMC), to explore aberrantly expressed miRNAs among different cancers.ResultsWe collected the miRNA expression profiles of 14 cancer types, curated from 48 microarray data sets in peer-reviewed publications. The Significance Analysis of Microarrays method was used to retrieve the miRNAs that have dramatically different expression levels in cancers when compared to normal tissues. This database provides statistical results for differentially expressed miRNAs in each data set. A total of 607 differentially expressed miRNAs (590 mature miRNAs and 17 precursor miRNAs) were obtained in the current version of dbDEMC. Furthermore, low-throughput data from the same literature were also included in the database for validation. An easy-to-use web interface was designed for users. Annotations about each miRNA can be queried through miRNA ID or miRBase accession numbers, or can be browsed by different cancer types.ConclusionsThis database is expected to be a valuable source for identification of cancer-related miRNAs, thereby helping with the improvement of classification, diagnosis and treatment of human cancers. All the information is freely available through http://159.226.118.44/dbDEMC/index.html.
Long non-coding RNAs (lncRNAs) are a heterogeneous class of RNAs that are generally defined as non-protein-coding transcripts longer than 200 nucleotides. Recently, an increasing number of studies have shown that lncRNAs can be involved in various critical biological processes, such as chromatin remodeling, gene transcription, and protein transport and trafficking. Moreover, lncRNAs are dysregulated in a number of complex human diseases, including coronary artery diseases, autoimmune diseases, neurological disorders, and various cancers, which indicates their important roles in these diseases. Here, we reviewed the current understanding of lncRNAs, including their definition and subclassification, regulatory functions, and potential roles in different types of complex human diseases.
Background: MicroRNAs (miRNAs), a growing class of small RNAs with crucial regulatory roles at the post-transcriptional level, are usually found to be clustered on chromosomes. However, with the exception of a few individual cases, so far little is known about the functional consequence of this conserved clustering of miRNA loci. In animal genomes such clusters often contain nonhomologous miRNA genes. One hypothesis to explain this heterogeneity suggests that clustered miRNAs are functionally related by virtue of co-targeting downstream pathways.
Recent interest in the non-coding transcriptome has resulted in the identification of large numbers of long non-coding RNAs (lncRNAs) in mammalian genomes, most of which have not been functionally characterized. Computational exploration of the potential functions of these lncRNAs will therefore facilitate further work in this field of research. We have developed a practical and user-friendly web interface called ncFANs (non-coding RNA Function ANnotation server), which is the first web service for functional annotation of human and mouse lncRNAs. On the basis of the re-annotated Affymetrix microarray data, ncFANs provides two alternative strategies for lncRNA functional annotation: one utilizing three aspects of a coding-non-coding gene co-expression (CNC) network, the other identifying condition-related differentially expressed lncRNAs. ncFANs introduces a highly efficient way of re-using the abundant pre-existing microarray data. The present version of ncFANs includes re-annotated CDF files for 10 human and mouse Affymetrix microarrays, and the server will be continuously updated with more re-annotated microarray platforms and lncRNA data. ncFANs is freely accessible at http://www.ebiomed.org/ncFANs/ or http://www.noncode.org/ncFANs/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.