The publication of a draft of the human genome and of large collections of transcribed sequences has made it possible to study the complex relationship between the transcriptome and the genome. In the work presented here, we have focused on mapping mRNA 3Ј ends onto the genome by use of the raw data generated by the expressed sequence tag (EST) sequencing projects. We find that at least half of the human genes encode multiple transcripts whose polyadenylation is driven by multiple signals. The corresponding transcript 3Ј ends are spread over distances in the kilobase range. This finding has profound implications for our understanding of gene expression regulation and of the diversity of human transcripts, for the design of cDNA microarray probes, and for the interpretation of gene expression profiling experiments.
DEAD-box proteins comprise a family of ATP-dependent RNA helicases involved in several aspects of RNA metabolism. Here we report the characterization of the human DEAD-box RNA helicase DDX26. The gene is composed of 14 exons distributed over an extension of 8,123 bp of genomic sequence and encodes a transcript of 1.8 kb that is expressed in all tissues evaluated. The predicted amino acid sequence shows a high similarity to a yeast DEAD-box RNA helicase (Dbp9b) involved in ribosome biogenesis. The new helicase maps to 7p12, a region of frequent chromosome amplifications in glioblastomas involving the epidermal growth factor receptor (EGFR) gene. Nevertheless, co-amplification of DDX26 with EGFR was not detected in nine tumors analyzed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.