Cytochrome P450 monooxygenases (CYPs) constitute a ubiquitous, highly divergent protein family. Nevertheless, all CYPs share a common fold and conserved catalytic machinery. Based on the electron donor system, 10 classes of CYPs have been described, but most CYPs are members of class I accepting electrons from ferredoxin which is being reduced by FAD-containing reductase, or class II accepting electrons from FAD- and FMN-containing CPR-type reductase. Because of the low sequence conservation inside the two classes, the conserved class-specific positions are expected to be involved in aspects of electron transfer that are specific to the two types of reductases. In this work we present results from a conservation analysis of 16,732 CYP sequences derived from an updated version of the Cytochrome P450 Engineering Database (CYPED), using two class-specific numbering schemes. While no position was conserved on the distal, substrate-binding surface of the CYPs, several class-specific residues were found on the proximal, reductase-interacting surface; two class I-specific residues that were negatively charged, and three class II-specific residues that were aromatic or charged. The class-specific conservation of glycine and proline residues in the cysteine pocket indicates that there are class-specific differences in the flexibility of this element. Four heme-interacting arginines were conserved differently in each class, and a class-specific substitution of a heme-interacting tyrosine by histidine was found, pointing to a link between heme stabilization and the reductase type.
Thiamine diphosphate-dependent enzymes are broadly distributed in all organisms, and they catalyse a broad range of C-C bond forming and breaking reactions. Enzymes belonging to the structural families of decarboxylases and transketolases have been particularly well investigated concerning their substrate range, mechanism of stereoselective carboligation and carbolyase reaction. Both structurally different enzyme families differ also in stereoselectivity: enzymes from the decarboxylase family are predominantly R-selective, whereas those from the transketolase family are S-selective. In recent years a key focus of our studies has been on stereoselective benzoin condensation-like 1,2-additions. Meanwhile, several S-selective variants of pyruvate decarboxylase, benzoylformate decarboxylase and 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPH-CHC) synthase as well as R-selective transketolase variants were created that allow access to a broad range of enantiocomplementary a-hydroxyketones and a,a′-dihydroxyketones. This review covers recent studies and the mechanistic understanding of stereoselective C-C bond forming thiamine diphosphate-dependent enzymes, which has been guided by structure-function analyses based on mutagenesis studies and from influences of different substrates and organic co-solvents on stereoselectivity.
BackgroundStandard numbering schemes for families of homologous proteins allow for the unambiguous identification of functionally and structurally relevant residues, to communicate results on mutations, and to systematically analyse sequence-function relationships in protein families. Standard numbering schemes have been successfully implemented for several protein families, including lactamases and antibodies, whereas a numbering scheme for the structural family of thiamine-diphosphate (ThDP) -dependent decarboxylases, a large subfamily of the class of ThDP-dependent enzymes encompassing pyruvate-, benzoylformate-, 2-oxo acid-, indolpyruvate- and phenylpyruvate decarboxylases, benzaldehyde lyase, acetohydroxyacid synthases and 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexadiene-1-carboxylate synthase (MenD) is still missing.Despite a high structural similarity between the members of the ThDP-dependent decarboxylases, their sequences are diverse and make a pairwise sequence comparison of protein family members difficult.ResultsWe developed and validated a standard numbering scheme for the family of ThDP-dependent decarboxylases. A profile hidden Markov model (HMM) was created using a set of representative sequences from the family of ThDP-dependent decarboxylases. The pyruvate decarboxylase from S. cerevisiae (PDB: 2VK8) was chosen as a reference because it is a well characterized enzyme. The crystal structure with the PDB identifier 2VK8 encompasses the structure of the ScPDC mutant E477Q, the cofactors ThDP and Mg2+ as well as the substrate analogue (2S)-2-hydroxypropanoic acid. The absolute numbering of this reference sequence was transferred to all members of the ThDP-dependent decarboxylase protein family. Subsequently, the numbering scheme was integrated into the already established Thiamine-diphosphate dependent Enzyme Engineering Database (TEED) and was used to systematically analyze functionally and structurally relevant positions in the superfamily of ThDP-dependent decarboxylases.ConclusionsThe numbering scheme serves as a tool for the reliable sequence alignment of ThDP-dependent decarboxylases and the unambiguous identification and communication of corresponding positions. Thus, it is the basis for the systematic and automated analysis of sequence-encoded properties such as structural and functional relevance of amino acid positions, because the analysis of conserved positions, the identification of correlated mutations and the determination of subfamily specific amino acid distributions depend on reliable multisequence alignments and the unambiguous identification of the alignment columns. The method is reliable and robust and can easily be adapted to further protein families.
Thiamine diphosphate (ThDP)-dependent enzymes form a diverse protein family which was classified into nine superfamilies. The cofactor ThDP is bound at the interface between two catalytic domains, the PYR and the PP domain. The nine superfamilies were assigned to five different structural architectures. Two superfamilies, the sulfopyruvate decarboxylases and α-ketoacid dehydrogenases 2, consist of separate PYR and PP domains. The oxidoreductase superfamily is of the intra-monomer/PYR-PP type with an N-terminal PYR and a subsequent PP domain. The active enzymes form homodimers with the ThDP cofactor bound at the interface between a PYR and a PP domain of the same monomer. Decarboxylases are of the inter-monomer/PYR-PP type with the cofactor bound between domains from different monomers. 1-Deoxy-d-xylulose-5-phosphate synthases are of the intra-monomer/PP-PYR type. The transketolases, α-ketoglutarate dehydrogenases, and α-ketoacid dehydrogenases 1 are of the inter-monomer/PP-PYR type. For the phosphonopyruvate decarboxylases, definitive assessment of the structural architecture is not possible due to lack of structure information. By applying a structure-based domain alignment method, sequences of more than 62,000 PYR and PP domains were identified and aligned. Although the sequence similarity of the catalytic domains is low between different superfamilies, seven positions were identified to be highly conserved, including the cofactor binding GDGX(24,27) N motif, the cofactor-activating glutamic acid, and two structurally equivalent glycines in both the PYR and the PP domain. An evolutionary pathway of ThDP-dependent enzymes is proposed which explains the sequence and structure diversity of this family by three basic evolutionary events: domain recruitment, domain linkage, and structural rearrangement of catalytic domains.
Cytochrome P450 monooxygenases (CYPs) are a large, highly diverse protein family with a common fold. The sequences, structures, and functions of CYPs have been extensively studied resulting in more than 53,000 scientific articles. A sequence-based literature mining algorithm was designed to systematically analyze this wealth of information on SNPs, designed mutations, structural interactions, or functional roles of individual residues. Structurally corresponding positions in different CYPs were compared and universal selectivity-determining positions were identified. Based on the Cytochrome P450 Engineering Database (www.CYPED.BioCatNet.de) and a standard numbering scheme for all CYPs, 4000 residues in 168 CYPs mentioned in 2400 articles could be assigned to 440 structurally corresponding standard positions of the CYP fold, covering 96% of all standard positions. Seventeen individual standard positions were mentioned in the context of more than 32 different CYPs. The majority of these most frequently mentioned positions are located on the six substrate recognition sites and are involved in control of selectivity, such as the well-studied position 87 in CYP102A1 (P450(BM-3)) which was mentioned in the articles on 63 different CYPs. The recurrent citation of the 17 frequently mentioned positions for different CYPs suggests their universal functional relevance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.