Maize genetic diversity has been used to understand the molecular basis of phenotypic variation and to improve agricultural efficiency and sustainability. We crossed 25 diverse inbred maize lines to the B73 reference line, capturing a total of 136,000 recombination events. Variation for recombination frequencies was observed among families, influenced by local (cis) genetic variation. We identified evidence for numerous minor single-locus effects but little two-locus linkage disequilibrium or segregation distortion, which indicated a limited role for genes with large effects and epistatic interactions on fitness. We observed excess residual heterozygosity in pericentromeric regions, which suggested that selection in inbred lines has been less efficient in these regions because of reduced recombination frequency. This implies that pericentromeric regions may contribute disproportionally to heterosis.
The multifunctional Escherichia coli proline utilization A (PutA) flavoprotein functions both as a membrane-associated proline catabolic enzyme and as a transcriptional repressor of the proline utilization genes putA and putP. To better understand the mechanism of transcriptional regulation by PutA, we have mapped the put-regulatory region, determined a crystal structure of the PutA ribbon-helix-helix domain (PutA52, a polypeptide corresponding to residues 1-52 of E. coli PutA) complexed with DNA, and examined the thermodynamics of DNA binding to PutA52. Five operator sites, each containing the sequence motif 5′-GTTGCA-3′, were identified using gelshift analysis. Three of the sites are shown to be critical for repression of putA, whereas the two other sites are important for repression of putP. The 2.25-Å-resolution crystal structure of PutA52 bound to one of the operators (operator 2; 21 bp) shows that the protein contacts a 9-bp fragment corresponding to the GTTGCA consensus motif plus three flanking base pairs. Since the operator sequences differ in flanking bases, the structure implies that PutA may have different affinities for the five operators. This hypothesis was explored using isothermal titration calorimetry. The binding of PutA52 to operator 2 is exothermic, with an enthalpy of − 1.8 kcal/mol and a dissociation constant of 210 nM. Substitution of the flanking bases of operator 4 into operator 2 results in an unfavorable enthalpy of 0.2 kcal/mol and a 15-fold-lower affinity, showing that base pairs outside of the consensus motif impact binding. Structural and thermodynamic data suggest that hydrogen bonds between Lys9 and bases adjacent to the GTTGCA motif contribute to transcriptional regulation by fine-tuning the affinity of PutA for put control operators.
Systematic evolution of ligands through exponential enrichment (SELEX) is a well-established method for generating nucleic acid populations that are enriched for specified functions. High-throughput sequencing (HTS) enhances the power of comparative sequence analysis to reveal details of how RNAs within these populations recognize their targets. We used HTS analysis to evaluate RNA populations selected to bind type I human immunodeficiency virus reverse transcriptase (RT). The populations are enriched in RNAs of independent lineages that converge on shared motifs and in clusters of RNAs with nearly identical sequences that share common ancestry. Both of these features informed inferences of the secondary structures of enriched RNAs, their minimal structural requirements and their stabilities in RT-aptamer complexes. Monitoring population dynamics in response to increasing selection pressure revealed RNA inhibitors of RT that are more potent than the previously identified pseudoknots. Improved potency was observed for inhibition of both purified RT in enzymatic assays and viral replication in cell-based assays. Structural and functional details of converged motifs that are obscured by simple consensus descriptions are also revealed by the HTS analysis. The approach presented here can readily be generalized for the efficient and systematic post-SELEX development of aptamers for down-stream applications.
A computational comparison of 102 high-resolution (Յ1.90 Å) enzyme-dinucleotide (NAD, NADP, FAD) complexes was performed to investigate the role of solvent in dinucleotide recognition by Rossmann fold domains. The typical binding site contains about 9-12 water molecules, and about 30% of the hydrogen bonds between the protein and the dinucleotide are water mediated. Detailed inspection of the structures reveals a structurally conserved water molecule bridging dinucleotides with the well-known glycine-rich phosphate-binding loop. This water molecule displays a conserved hydrogen-bonding pattern. It forms hydrogen bonds to the dinucleotide pyrophosphate, two of the three conserved glycine residues of the phosphate-binding loop, and a residue at the C-terminus of strand four of the Rossmann fold. The conserved water molecule is also present in high-resolution structures of apo enzymes. However, the water molecule is not present in structures displaying significant deviations from the classic Rossmann fold motif, such as having nonstandard topology, containing a very short phosphate-binding loop, or having ␣-helix "A" oriented perpendicular to the -sheet. Thus, the conserved water molecule appears to be an inherent structural feature of the classic Rossmann dinucleotide-binding domain.
Teosinte (Zea mays subsp. parviglumis H. H. Iltis & Doebley) has greater genetic diversity than maize inbreds and landraces (Z. mays subsp. mays). There are, however, limited genetic resources to efficiently evaluate and tap this diversity. To broaden resources for genetic diversity studies in maize, we developed and evaluated 928 near-isogenic introgression lines (NILs) from 10 teosinte accessions in the B73 background. Joint linkage analysis of the 10 introgression populations identified several large-effect quantitative trait loci (QTL) for days to anthesis (DTA), kernel row number (KRN), and 50-kernel weight (Wt50k). Our results confirm prior reports of kernel domestication loci and identify previously uncharacterized QTL with a range of allelic effects enabling future research into the genetic basis of these traits. Additionally, we used a targeted set of NILs to validate the effects of a KRN QTL located on chromosome 2. These introgression populations offer novel tools for QTL discovery and validation as well as a platform for initiating fine mapping.
Protein-bound water molecules are important components of protein structure, and therefore, protein function and energetics. Although structural conservation of solvent has been studied in a few protein families, a lack of suitable computational tools has hindered more comprehensive analyses. Herein we present a semiautomated computational approach for identifying solvent sites that are conserved among proteins sharing a common three-dimensional structure. This method is tested on six protein families: (1) monodomain cytochrome c, (2) fatty-acid binding protein, (3) lactate/malate dehydrogenase, (4) parvalbumin, (5) phospholipase A2, and (6) serine protease. For each family, the method successfully identified previously known conserved solvent sites. Moreover, the method discovered 22 novel conserved solvent sites, some of which have higher degrees of conservation than the previously known sites. All six families studied had solvent sites with more than 90% conservation and these sites were invariably located in regions of the protein with very high sequence conservation. These results suggest that highly conserved solvent sites, by virtue of their proximity to conserved residues, should be considered as one of the defining three-dimensional structural characteristics of protein families and folds.
The Kelch repeat is a common sequence motif in eukaryotic genomes and is approximately 50 amino acids in length. The structure of the Kelch domain of the human Keap1 protein has previously been determined at 1.85 Angstrom, showing that each Kelch repeat forms one blade of a six-bladed beta-propeller. Here, use of 1.35 Angstrom SAD data for de novo structure determination of the Kelch domain and for refinement at atomic resolution is described. The high quality and resolution of the diffraction data and phase information allows a detailed analysis of the role of solvent in the structure of the Kelch repeat. Ten structurally conserved water molecules are identified in each blade of the Kelch beta-propeller. These appear to play distinct structural roles that include lining the central channel of the propeller, interacting with residues in loops between strands of the blade and making contacts with conserved residues in the Kelch repeat. Furthermore, we identify a conserved C-H...pi hydrogen bond between two key residues in the consensus Kelch repeat. This analysis extends our understanding of the structural roles of conserved residues in the Kelch repeat and highlights the potential role of solvent in maintaining the fold of this common eukaryotic structural motif.
Premise Whole‐genome duplications (WGDs) are prevalent throughout the evolutionary history of plants. For example, dozens of WGDs have been phylogenetically localized across the order Brassicales, specifically, within the family Brassicaceae. A WGD event has also been identified in the Cleomaceae, the sister family to Brassicaceae, yet its placement, as well as that of WGDs in other families in the order, remains unclear. Methods Phylo‐transcriptomic data were generated and used to infer a nuclear phylogeny for 74 Brassicales taxa. Genome survey sequencing was also performed on 66 of those taxa to infer a chloroplast phylogeny. These phylogenies were used to assess and confirm relationships among the major families of the Brassicales and within Brassicaceae. Multiple WGD inference methods were then used to assess the placement of WGDs on the nuclear phylogeny. Results Well‐supported chloroplast and nuclear phylogenies for the Brassicales and the putative placement of the Cleomaceae‐specific WGD event Th‐ɑ are presented. This work also provides evidence for previously hypothesized WGDs, including a well‐supported event shared by at least two members of the Resedaceae family, and a possible event within the Capparaceae. Conclusions Phylogenetics and the placement of WGDs within highly polyploid lineages continues to be a major challenge. This study adds to the conversation on WGD inference difficulties by demonstrating that sampling is especially important for WGD identification and phylogenetic placement. Given its economic importance and genomic resources, the Brassicales continues to be an ideal group for assessing WGD inference methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.