The most important product of the sequencing of a genome is a complete, accurate catalogue of genes and their products, primarily messenger RNA transcripts and their cognate proteins. Such a catalogue cannot be constructed by computational annotation alone; it requires experimental validation on a genome scale. Using 'exon' and 'tiling' arrays fabricated by ink-jet oligonucleotide synthesis, we devised an experimental approach to validate and refine computational gene predictions and define full-length transcripts on the basis of co-regulated expression of their exons. These methods can provide more accurate gene numbers and allow the detection of mRNA splice variants and identification of the tissue- and disease-specific conditions under which genes are expressed. We apply our technique to chromosome 22q under 69 experimental condition pairs, and to the entire human genome under two experimental conditions. We discuss implications for more comprehensive, consistent and reliable genome annotation, more efficient, full-length complementary DNA cloning strategies and application to complex diseases.
The nematode Caenorhabditis elegans is the first animal whose genome is completely sequenced, providing a rich source of gene information relevant to metazoan biology and human disease. This abundant sequence information permits a broad-based gene inactivation approach in C. elegans, in which chemically mutagenized nematode populations are screened by PCR for deletion mutations in a specific targeted gene. By handling mutagenized worm growth, genomic DNA templates, PCR screens, and mutant recovery all in 96-well microtiter plates, we have scaled up this approach to isolate deletion mutations in >100 genes to date. Four chemical mutagens, including ethyl methane sulfonate, ethlynitrosourea, diepoxyoctane, and ultraviolet-activated trimethylpsoralen, induced detectable deletions at comparable frequencies. The deletions averaged ∼1400 bp in size when using a ∼3 kb screening window. The vast majority of detected deletions removed portions of one or more exons, likely resulting in loss of gene function. This approach requires only the knowledge of a target gene sequence and a suitable mutagen, and thus provides a scalable systematic approach to gene inactivation for any organism that can be handled in high density arrays.
Ribozymes use specific RNA-RNA interactions for substrate binding and active-site formation. Self-splicing group I introns have approximately 70 nucleotides constituting the core, a region containing sequences and structures indispensable for catalytic function. The catalytic core must interact with the substrates used for the two steps of the self-splicing reaction, that is, guanosine, the 5'-splice-site helix (P1) and the 3' splice site. Mutational evidence suggests that core sequences near segment J6/7 that joins the base-paired stems P6 and P7, and the bulged base of P7(5'), participate in binding guanosine substrate, but nothing is known about the interactions between the core, the 5'-splice-site helix and the 3' splice site. On the basis of comparative sequence data, it has been suggested that two specific bases in the catalytic core of group I introns might form a binding sequence for the 3' splice site. Here we present genetic evidence that such a binding site exists in the core of the Tetrahymena large subunit ribosomal RNA intron. We demonstrate that this pairing, termed P9.0, is functionally important in the exon ligation step of self-splicing, but is not itself responsible for 3'-splice-site selection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.