Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations.
Plant adaptation to external pressures depends on functional diversity in cytochrome P450 (CYP) enzymes. CYPs contain structural domains necessary for the characteristic P450 fold that allows monooxygenation, but they also have great variation in substrate binding affinity. Plant genomes typically contain hundreds of CYPs that contribute to essential functions and species-specific metabolism. The CYP72A subfamily is conserved in angiosperms but its contribution to physiological functions is largely unknown. With genomic information available for many plants, a focused analysis of CYP subfamily diversity is important to understand the contributions of these enzymes to plant evolution. This study examines the extent to which independent gene duplication and evolution have contributed to structural diversification of CYP72A enzymes in different plant lineages. CYP72A genes are prevalent across angiosperms, but the number of genes within each genome varies greatly. The prevalence of CYP72As suggest that the last common ancestor of flowering plants contained a CYP72A sequence, but gene duplication and retention has varied greatly for this CYP subfamily. Sequence comparisons show that CYP72As are involved in species-specific metabolic functions in some plants while there is likely functional conservation between closely related species. Analysis of structural and functional domains within groups of CYP72As reveals clade-specific residues that contribute to functional constraints within subsets of CYP72As. This study provides a phylogenetic framework that allows comparisons of structural features within subsets of the CYP72A subfamily. We examined a large number of sequences from a broad collection of plant species to detect patterns of functional conservation across the subfamily. The evolutionary relationships between CYPs in plant genomes are an important component in understanding the evolution of biochemical diversity in plants.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.