REBASE is a comprehensive and fully curated database of information about the components of restriction-modification (RM) systems. It contains fully referenced information about recognition and cleavage sites for both restriction enzymes and methyltransferases as well as commercial availability, methylation sensitivity, crystal and sequence data. All genomes that are completely sequenced are analyzed for RM system components, and with the advent of PacBio sequencing, the recognition sequences of DNA methyltransferases (MTases) are appearing rapidly. Thus, Type I and Type III systems can now be characterized in terms of recognition specificity merely by DNA sequencing. The contents of REBASE may be browsed from the web http://rebase.neb.com and selected compilations can be downloaded by FTP (ftp.neb.com). Monthly updates are also available via email.
A nomenclature is described for restriction endonucleases, DNA methyltransferases, homing endonucleases and related genes and gene products. It provides explicit categories for the many different Type II enzymes now identified and provides a system for naming the putative genes found by sequence analysis of microbial genomes.
REBASE is a comprehensive database of information about restriction enzymes, DNA methyltransferases and related proteins involved in the biological process of restriction–modification (R–M). It contains fully referenced information about recognition and cleavage sites, isoschizomers, neoschizomers, commercial availability, methylation sensitivity, crystal and sequence data. Experimentally characterized homing endonucleases are also included. The fastest growing segment of REBASE contains the putative R–M systems found in the sequence databases. Comprehensive descriptions of the R–M content of all fully sequenced genomes are available including summary schematics. The contents of REBASE may be browsed from the web (http://rebase.neb.com) and selected compilations can be downloaded by ftp (ftp.neb.com). Additionally, monthly updates can be requested via email.
Single-molecule real-time (SMRT) DNA sequencing allows the systematic detection of chemical modifications such as methylation but has not previously been applied on a genome-wide scale. We used this approach to detect 49,311 putative 6-methyladenine (m6A) residues and 1,407 putative 5-methylcytosine (m5C) residues in the genome of a pathogenic Escherichia coli strain. We obtained strand-specific information for methylation sites and a quantitative assessment of the frequency of methylation at each modified position. We deduced the sequence motifs recognized by the methyltransferase enzymes present in this strain without prior knowledge of their specificity. Furthermore, we found that deletion of a phage-encoded methyltransferase-endonuclease (restriction-modification; RM) system induced global transcriptional changes and led to gene amplification, suggesting that the role of RM systems extends beyond protecting host genomes from foreign DNA.
Twenty AdoMet-dependent methyltransferases (MTases) have been characterized structurally by X-ray crystallography and NMR. These include seven DNA MTases, five RNA MTases, four protein MTases and four small molecule MTases acting on the carbon, oxygen or nitrogen atoms of their substrates. The MTases share a common core structure of a mixed seven-stranded beta-sheet (6 downward arrow 7 upward arrow 5 downward arrow 4 downward arrow 1 downward arrow 2 downward arrow 3 downward arrow) referred to as an 'AdoMet-dependent MTase fold', with the exception of a protein arginine MTase which contains a compact consensus fold lacking the antiparallel hairpin strands (6 downward arrow 7 upward arrow). The consensus fold is useful to identify hypothetical MTases during structural proteomics efforts on unannotated proteins. The same core structure works for very different classes of MTase including those that act on substrates differing in size from small molecules (catechol or glycine) to macromolecules (DNA, RNA and protein). DNA MTases use a 'base flipping' mechanism to deliver a specific base within a DNA molecule into a typically concave catalytic pocket. Base flipping involves rotation of backbone bonds in double-stranded DNA to expose an out-of-stack nucleotide, which can then be a substrate for an enzyme-catalyzed chemical reaction. The phenomenon is fully established for DNA MTases and for DNA base excision repair enzymes, and is likely to prove general for enzymes that require access to unpaired, mismatched or damaged nucleotides within base-paired regions in DNA and RNA. Several newly discovered MTase families in eukaryotes (DNA 5mC MTases and protein arginine and lysine MTases) offer new challenges in the MTase field.
Of the current next-generation sequencing technologies, SMRT sequencing is sometimes overlooked. However, attributes such as long reads, modified base detection and high accuracy make SMRT a useful technology and an ideal approach to the complete sequencing of small genomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.