The genome of the japonica subspecies of rice, an important cereal and model monocot, was sequenced and assembled by whole-genome shotgun sequencing. The assembled sequence covers 93% of the 420-megabase genome. Gene predictions on the assembled sequence suggest that the genome contains 32,000 to 50,000 genes. Homologs of 98% of the known maize, wheat, and barley proteins are found in rice. Synteny and gene homology between rice and the other cereal genomes are extensive, whereas synteny with Arabidopsis is limited. Assignment of candidate rice orthologs to Arabidopsis genes is possible in many cases. The rice genome sequence provides a foundation for the improvement of cereals, our most important crops.
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.
We report here the molecular cloning of an Ϸ1-Mb region of recurrent amplification at 20q13.2 in breast cancer and other tumors and the delineation of a 260-kb common region of amplification. Analysis of the 1-Mb region produced evidence for five genes, ZNF217, ZNF218, and NABC1, PIC1L (PIC1-like), CYP24, and a pseudogene CRP (Cyclophillin Related Pseudogene). ZNF217 and NABC1 emerged as strong candidate oncogenes and were characterized in detail. NABC1 is predicted to encode a 585-aa protein of unknown function and is overexpressed in most but not all breast cancer cell lines in which it was amplified. ZNF217 is centrally located in the 260-kb common region of amplification, transcribed in multiple normal tissues, and overexpressed in all cell lines and tumors in which it is amplified and in two in which it is not. ZNF217 is predicted to encode alternately spliced, Kruppel-like transcription factors of 1,062 and 1,108 aa, each having a DNA-binding domain (eight C2H2 zinc fingers) and a proline-rich transcription activation domain.Studies in which comparative genomic hybridization was used have revealed Ϸ20 regions of recurrent increased DNA sequence copy number in breast tumors (1-3). These regions are predicted to encode dominantly acting genes that may play a role in tumor progression or response to therapy. To date, three of these regions have been associated with established oncogenes: ERBB2 at 17q12, MYC at 8q24, and CCND1 and EMS1 at 11q13. In breast cancer, ERBB2 and CCND1͞EMS1 amplification and overexpression are associated with decreased life expectancy (4, 5), whereas MYC amplification has been associated with lymph node involvement, advanced stage, and an increased rate of relapse (6, 7). Efforts are now underway in several laboratories to identify oncogenes in the other regions of amplification.Amplification at 20q13 is particularly interesting because this aberration occurs in a variety of tumor types and is associated with aggressive tumor behavior. The initial comparative genomic hybridization study showed increased copy number involving 20q13 in 40% of breast cancer cell lines and 18% of primary breast tumors (2). Since then, other comparative genomic hybridization studies have revealed copynumber gains at 20q13 in greater than 25% of cancers of the ovary (8), colon (9), head and neck (10
Identification of ciliated sensory neuron-expressed genes in Caenorhabditis elegans
An mRNA-tagging method was used to selectively isolate mRNA from a small number of cells for subsequent cDNA microarray analysis. The approach was used to identify genes specifically expressed in ciliated sensory neurons of Caenorhabditis elegans.
Abstract It is not always easy to apply microarray technology to small numbers of cells because of the difficulty in selectively isolating mRNA from such cells. We report here the preparation of mRNA from ciliated sensory neurons of Caenorhabditis elegans using the mRNA-tagging method, in which poly(A) RNA was co-immunoprecipitated with an epitope-tagged poly(A)-binding protein specifically expressed in sensory neurons. Subsequent cDNA microarray analyses led to the identification of a panel of sensory neuron-expressed genes.
Next generation drug screening could benefit greatly from in vivo studies, using small animal models such as Caenorhabditis elegans for hit identification and lead optimization. Current in vivo assays can operate either at low throughput with high resolution or with low resolution at high throughput. To enable both high-throughput and high-resolution imaging of C. elegans, we developed an automated microfluidic platform. This platform can image 15 z-stacks of ∼4,000 C. elegans from 96 different populations using a large-scale chip with a micron resolution in 16 min. Using this platform, we screened ∼100,000 animals of the poly-glutamine aggregation model on 25 chips. We tested the efficacy of ∼1,000 FDA-approved drugs in improving the aggregation phenotype of the model and identified four confirmed hits. This robust platform now enables high-content screening of various C. elegans disease models at the speed and cost of in vitro cell-based assays.
SJG-136 (3) is a novel pyrrolobenzodiazepine (PBD) dimer that is predicted from molecular models to bind in the minor groove of DNA and to form sequence-selective interstrand cross-links at 5'-Pu-GATC-Py-3' (Pu = purine; Py = pyrimidine) sites through covalent bonding between each PBD unit and guanines on opposing strands. Footprinting studies have confirmed that high-affinity adducts do form at 5'-G-GATC-C-3' sequences and that these can inhibit RNA polymerase in a sequence-selective manner. At higher concentrations of SJG-136, bands that migrate more slowly than one of the 5'-G-GATC-C-3' footprint sites show significantly reduced intensity, concomitant with the appearance of higher molecular weight material near the gel origin. This phenomenon is attributed to interstrand cross-linking at the 5'-G-GATC-C-3' site and is the first report of DNA footprinting being used to detect interstrand cross-linked adducts. The control dimer GD113 (4), of similar structure to SJG-136 but unable to cross-link DNA due to its C7/C7'-linkage rather than C8/C8'-linkage, neither produces footprints with the same DNA sequence nor blocks transcription at comparable concentrations. In addition to the two high-affinity 5'-G-GATC-C-3' footprints on the MS2 DNA sequence, other SJG-136 adducts of lower affinity are observed that can still block transcription but with lower efficiency. All these sites contain the 5'-GXXC-3' motif (where XX includes AG, TA, GC, CT, TT, GG, and TC) and represent less-favored cross-link sites. In time-course experiments, SJG-136 blocks transcription if incubated with a double-stranded DNA template before the transcription components are added; addition after transcription is initiated fails to elicit blockage. Single-strand ligation PCR studies on a sequence from the c-jun gene show that SJG-136 binds to 5'-GAAC-3'/5'-GTTC-3' (preferred) or 5'-GAGC-3'/5'-GCTC-3' sequences. Significantly, adducts are obtained at the same sequences following extraction of DNA from drug-treated K562 cells, confirming that the agent reaches the cellular genome and interacts with the DNA in a sequence-selective fashion. Finally, SJG-136 efficiently inhibits the action of restriction endonuclease BglII, which has a 5'-A-GATC-T-3' motif at its cleavage site.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.