The benzyl alcohol O-acetyl transferase, anthocyanin O-hydroxycinnamoyl transferase, N-hydroxycinnamoyl anthranilate benzoyl transferase, and deacetylvindoline 4-O-acetyltransferase (BAHD) enzymes play a critical role in regulating plant metabolites and affecting cell stability. In the present study, members of the BAHD gene family were recognized in the genome of Theobroma cacao and characterized using various bioinformatics tools. We found 27 non-redundant putative tcBAHD genes in cacao for the first time. Our findings indicate that tcBAHD genes are diverse based on sequence structure, physiochemical properties, and function. When analyzed with BAHDs of Gossypium raimondii and Corchorus capsularis clustered into four main groups. According to phylogenetic analysis, BAHD genes probably evolved drastically after their divergence. The divergence time of duplication events with purifying selection pressure was predicted to range from 1.82 to 15.50 MYA. Pocket analysis revealed that serine amino acid is more common in the binding site than other residuals, reflecting its key role in regulating the activity of tcBAHDs. Furthermore, cis-acting elements related to the responsiveness of stress and hormone, particularly ABA and MeJA, were frequently observed in the promoter region of tcBAHD genes. RNA-seq analysis further illustrated that tcBAHD13 and tcBAHD26 are involved in response to Phytophthora megakarya fungi. In conclusion, it is likely that evolutionary processes, such as duplication events, have caused high diversity in the structure and function of tcBAHD genes.
Family Phyllanthaceae is one of the largest segregates of the eudicot order Malpighiales and its species are herb, shrub, and tree, which are mostly distributed in tropical regions. Certain taxonomic discrepancies exist at genus and family level. Here, we report chloroplast genomes of three Phyllanthaceae species—Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius— and compare them with six others previously reported Phyllanthaceae chloroplast genomes. The species of Phyllanthaceae displayed quadripartite structure, comprising inverted repeat regions (IRa and IRb) that separate large single copy (LSC) and small single copy (SSC) regions. The length of complete chloroplast genome ranged from 154,707 bp to 161,093 bp; LSC from 83,627 bp to 89,932 bp; IRs from 23,921 bp to 27,128 bp; and SSC from 17,424 bp to 19,441 bp. Chloroplast genomes contained 111 to 112 unique genes, including 77 to 78 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) that showed similarities in arrangement. The number of protein-coding genes varied due to deletion/pseudogenization of rps16 genes in Baccaurea ramiflora and Leptopus cordifolius. High variability was seen in number of oligonucleotide repeats while analysis of guanine-cytosine (GC) content, codon usage, amino acid frequency, simple sequence repeats analysis, synonymous and non-synonymous substitutions, and transition and transversion substitutions showed similarities in all Phyllanthaceae species. We detected a higher number of transition substitutions in the coding sequences than non-coding sequences. Moreover, the high number of transition substitutions was determined among the distantly related species in comparison to closely related species. Phylogenetic analysis shows the polyphyletic nature of the genus Phyllanthus which requires further verification. We also determined suitable polymorphic coding genes, including rpl22, ycf1, matK, ndhF, and rps15 which may be helpful for the reconstruction of the high-resolution phylogenetic tree of the family Phyllanthaceae using a large number of species in the future. Overall, the current study provides insight into chloroplast genome evolution in Phyllanthaceae.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.