We describe the core Protein Production Platform of the Northeast Structural Genomics Consortium (NESG) and outline the strategies used for producing high-quality protein samples. The platform is centered on the cloning, expression and purification of 6X-His-tagged proteins using T7-based Escherichia coli systems. The 6X-His tag allows for similar purification procedures for most targets and implementation of high-throughput (HTP) parallel methods. In most cases, the 6X-His-tagged proteins are sufficiently purified (> 97% homogeneity) using a HTP two-step purification protocol for most structural studies. Using this platform, the open reading frames of over 16,000 different targeted proteins (or domains) have been cloned as > 26,000 constructs. Over the past nine years, more than 16,000 of these expressed protein, and more than 4,400 proteins (or domains) have been purified to homogeneity in tens of milligram quantities (see Summary Statistics, http://nesg.org/statistics.html). Using these samples, the NESG has deposited more than 900 new protein structures to the Protein Data Bank (PDB). The methods described here are effective in producing eukaryotic and prokaryotic protein samples in E. coli. This paper summarizes some of the updates made to the protein production pipeline in the last five years, corresponding to phase 2 of the NIGMS Protein Structure Initiative (PSI-2) project. The NESG Protein Production Platform is suitable for implementation in a large individual laboratory or by a small group of collaborating investigators. These advanced automated and/or parallel cloning, expression, purification, and biophysical screening technologies are of broad value to the structural biology, functional proteomics, and structural genomics communities.
In this chapter, we concentrate on the production of high quality protein samples for NMR studies. In particular, we provide an in-depth description of recent advances in the production of NMR samples and their synergistic use with recent advancements in NMR hardware. We describe the protein production platform of the Northeast Structural Genomics Consortium, and outline our high-throughput strategies for producing high quality protein samples for nuclear magnetic resonance (NMR) studies. Our strategy is based on the cloning, expression and purification of 6X-His-tagged proteins using T7-based Escherichia coli systems and isotope enrichment in minimal media. We describe 96-well ligation-independent cloning and analytical expression systems, parallel preparative scale fermentation, and high-throughput purification protocols. The 6X-His affinity tag allows for a similar two-step purification procedure implemented in a parallel high-throughput fashion that routinely results in purity levels sufficient for NMR studies (> 97% homogeneity). Using this platform, the protein open reading frames of over 17,500 different targeted proteins (or domains) have been cloned as over 28,000 constructs. Nearly 5,000 of these proteins have been purified to homogeneity in tens of milligram quantities (see Summary Statistics, http://nesg.org/statistics.html), resulting in more than 950 new protein structures, including more than 400 NMR structures, deposited in the Protein Data Bank. The Northeast Structural Genomics Consortium pipeline has been effective in producing protein samples of both prokaryotic and eukaryotic origin. Although this paper describes our entire pipeline for producing isotope-enriched protein samples, it focuses on the major updates introduced during the last 5 years (Phase 2 of the National Institute of General Medical Sciences Protein Structure Initiative). Our advanced automated and/or parallel cloning, expression, purification, and biophysical screening technologies are suitable for implementation in a large individual laboratory or by a small group of collaborating investigators for structural biology, functional proteomics, ligand screening and structural genomics research.
Background: Although highly homologous to MBD2, the functional role of MBD3 remains in question. Results: MBD3 preferentially localizes to methylated and, to a lesser degree, unmethylated CpG dinucleotides. Conclusion: Dynamic distribution between methylated and unmethylated sites modifies the genomic localization of MBD3. Significance: Changes in the dynamic distribution on DNA dictate functional differences between MBD proteins.
Unlike other members of the methyl-cytosine binding domain (MBD) family, MBD4 serves as a potent DNA glycosylase in DNA mismatch repair specifically targeting mCpG/TpG mismatches arising from spontaneous deamination of methyl-cytosine. The protein contains an N-terminal MBD (MBD4MBD) and a C-terminal glycosylase domain (MBD4GD) separated by a long linker. This arrangement suggests that the MBD4MBD either directly augments enzymatic catalysis by the MBD4GD or targets the protein to regions enriched for mCpG/TpG mismatches. Here we present structural and dynamic studies of MBD4MBD bound to dsDNA. We show that MBD4MBD binds with a modest preference formCpG as compared to mismatch, unmethylated and hydroxymethylated DNA. We find that while MBD4MBD exhibits slow exchange between molecules of DNA (intermolecular exchange), the domain exhibits fast exchange between two sites in the same molecule of dsDNA (intramolecular exchange). Introducing a single-strand defect between binding sites does not greatly reduce the intramolecular exchange rate, consistent with a local hopping mechanism for moving along the DNA. These results support a model in which the MBD4MBD4 targets the intact protein to mCpG islands and promotes scanning by rapidly exchanging between successive mCpG sites which facilitates repair of nearby mCpG/TpG mismatches by the glycosylase domain.
The product of the meiosis-expressed gene 1 (MEIG1) is found in the cell bodies of spermatocytes and recruited to the manchette, a structure unique to elongating spermatids, by Parkin co-regulated gene (PACRG). This complex is essential for targeting cargo to the manchette during sperm flagellum assembly. Here we show that MEIG1 adopts a unique fold that provides a large surface for interacting with other proteins. We mutated 12 exposed and conserved amino acids and show that four of these mutations (W50A, K57E, F66A, Y68A) dramatically reduce binding to PACRG. These four amino acids form a contiguous hydrophobic patch on one end of the protein. Furthermore, each of these four mutations diminishes the ability of MEIG1 to stabilize PACRG when expressed in bacteria. Together these studies establish the unique structure and key interaction surface of MEIG1 and provide a framework to explore how MEIG1 recruits proteins to build the sperm tail.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.