Adhesive secretion has a fundamental role in barnacles’ survival, keeping them in an adequate position on the substrate under a variety of hydrologic regimes. It arouses special interest for industrial applications, such as antifouling strategies, underwater industrial and surgical glues, and dental composites. This study was focused on the goose barnacle Pollicipes pollicipes adhesion system, a species that lives in the Eastern Atlantic strongly exposed intertidal rocky shores and cliffs. The protein composition of P. pollicipes cement multicomplex and cement gland was quantitatively studied using a label-free LC-MS high-throughput proteomic analysis, searched against a custom transcriptome-derived database. Overall, 11,755 peptide sequences were identified in the gland while 2880 peptide sequences were detected in the cement, clustered in 1616 and 1568 protein groups, respectively. The gland proteome was dominated by proteins of the muscle, cytoskeleton, and some uncharacterized proteins, while the cement was, for the first time, reported to be composed by nearly 50% of proteins that are not canonical cement proteins, mainly unannotated proteins, chemical cues, and protease inhibitors, among others. Bulk adhesive proteins accounted for one-third of the cement proteome, with CP52k being the most abundant. Some unannotated proteins highly expressed in the proteomes, as well as at the transcriptomic level, showed similar physicochemical properties to the known surface-coupling barnacle adhesive proteins while the function of the others remains to be discovered. New quantitative and qualitative clues are provided to understand the diversity and function of proteins in the cement of stalked barnacles, contributing to the whole adhesion model in Cirripedia.
The cytochrome P450 (CYP) superfamily defends organisms from endogenous and noxious environmental compounds, and thus is crucial for survival. However, beyond mammals the molecular evolution of CYP2 subfamilies is poorly understood. Here, we characterized the CYP2 family across 48 avian whole genomes representing all major extant bird clades. Overall, 12 CYP2 subfamilies were identified, including the first description of the CYP2F, CYP2G, and several CYP2AF genes in avian genomes. Some of the CYP2 genes previously described as being lineage-specific, such as CYP2K and CYP2W, are ubiquitous to all avian groups. Furthermore, we identified a large number of CYP2J copies, which have been associated previously with water reabsorption. We detected positive selection in the avian CYP2C, CYP2D, CYP2H, CYP2J, CYP2K, and CYP2AC subfamilies. Moreover, we identified new substrate recognition sites (SRS0, SRS2_SRS3, and SRS3.1) and heme binding areas that influence CYP2 structure and function of functional importance as under significant positive selection. Some of the positively selected sites in avian CYP2D are located within the same SRS1 region that was previously linked with the metabolism of plant toxins. Additionally, we find that selective constraint variations in some avian CYP2 subfamilies are consistently associated with different feeding habits (CYP2H and CYP2J), habitats (CYP2D, CYP2H, CYP2J, and CYP2K), and migratory behaviors (CYP2D, CYP2H, and CYP2J). Overall, our findings indicate that there has been active enzyme site selection on CYP2 subfamilies and differential selection associated with different life history traits among birds.
Mitochondrial protein-coding genes (mt genes) encode subunits forming complexes of crucial cellular pathways, including those involved in the vital process of oxidative phosphorylation (OXPHOS). Despite the vital role of the mitochondrial genome (mt genome) in the survival of organisms, little is known with respect to its adaptive implications within marine invertebrates. The molluscan Class Cephalopoda is represented by a marine group of species known to occupy contrasting environments ranging from the intertidal to the deep sea, having distinct metabolic requirements, varied body shapes and highly advanced visual and nervous systems that make them highly competitive and successful worldwide predators. Thus, cephalopods are valuable models for testing natural selection acting on their mitochondrial subunits (mt subunits). Here, we used concatenated mt genes from 17 fully sequenced mt genomes of diverse cephalopod species to generate a robust mitochondrial phylogeny for the Class Cephalopoda. We followed an integrative approach considering several branches of interest–covering cephalopods with distinct morphologies, metabolic rates and habitats–to identify sites under positive selection and localize them in the respective protein alignment and/or tridimensional structure of the mt subunits. Our results revealed significant adaptive variation in several mt subunits involved in the energy production pathway of cephalopods: ND5 and ND6 from Complex I, CYTB from Complex III, COX2 and COX3 from Complex IV, and in ATP8 from Complex V. Furthermore, we identified relevant sites involved in protein-interactions, lining proton translocation channels, as well as disease/deficiencies related sites in the aforementioned complexes. A particular case, revealed by this study, is the involvement of some positively selected sites, found in Octopoda lineage in lining proton translocation channels (site 74 from ND5) and in interactions between subunits (site 507 from ND5) of Complex I.
Among the major goals of research in evolutionary biology are the identification of genes targeted by natural selection and understanding how various regimes of evolution affect the fitness of an organism. In particular, adaptive evolution enables organisms to adapt to changing ecological factors such as diet, temperature, habitat, predatory pressures and prey abundance. An integrative approach is crucial for the identification of non-synonymous mutations that introduce radical changes in protein biochemistry and thus in turn influence the structure and function of proteins. Performing such analyses manually is often a time-consuming process, due to the large number of statistical files generated from multiple approaches, especially when assessing numerous taxa and/or large datasets. We present IMPACT_S, an easy-to-use Graphical User Interface (GUI) software, which rapidly and effectively integrates, filters and combines results from three widely used programs for assessing the influence of selection: Codeml (PAML package), Datamonkey and TreeSAAP. It enables the identification and tabulation of sites detected by these programs as evolving under the influence of positive, neutral and/or negative selection in protein-coding genes. IMPACT_S further facilitates the automatic mapping of these sites onto the three-dimensional structures of proteins. Other useful tools incorporated in IMPACT_S include Jmol, Archaeopteryx, Gnuplot, PhyML, a built-in Swiss-Model interface and a PDB downloader. The relevance and functionality of IMPACT_S is shown through a case study on the toxicoferan-reptilian Cysteine-rich Secretory Proteins (CRiSPs). IMPACT_S is a platform-independent software released under GPLv3 license, freely available online from http://impact-s.sourceforge.net.
The vertebrate toll-like receptor (TLRs) supergene family is a first-line immune defense against viral and non-viral pathogens. Here, comparative evolutionary-genomics of 79 vertebrate species (8 mammals, 48 birds, 11 reptiles, 1 amphibian, and 11 fishes) revealed differential gain/loss of 26 TLRs, including 6 (TLR3, TLR7, TLR8, TLR14, TLR21, and TLR22) that originated early in vertebrate evolution before the diversification of Agnatha and Gnathostomata. Subsequent dynamic gene gain/loss led to lineage-specific diversification with TLR repertoires ranging from 8 subfamilies in birds to 20 in fishes. Lineage-specific loss of TLR8-9 and TLR13 in birds and gains of TLR6 and TLR10-12 in mammals and TLR19-20 and TLR23-27 in fishes. Among avian species, 5–10% of the sites were under positive selection (PS) (omega 1.5–2.5) with radical amino-acid changes likely affecting TLR structure/functionality. In non-viral TLR4 the 20 PS sites (posterior probability PP > 0.99) likely increased ability to cope with diversified ligands (e.g., lipopolysaccharide and lipoteichoic). For viral TLR7, 23 PS sites (PP > 0.99) possibly improved recognition of highly variable viral ssRNAs. Rapid evolution of the TLR supergene family reflects the host–pathogen arms race and the coevolution of ligands/receptors, which follows the premise that birds have been important vectors of zoonotic pathogens and reservoirs for viruses.
BackgroundUncovering how phenotypic diversity arises and is maintained in nature has long been a major interest of evolutionary biologists. Recent advances in genome sequencing technologies have remarkably increased the efficiency to pinpoint genes involved in the adaptive evolution of phenotypes. Reliability of such findings is most often examined with statistical and computational methods using Maximum Likelihood codon-based models (i.e., site, branch, branch-site and clade models), such as those available in codeml from the Phylogenetic Analysis by Maximum Likelihood (PAML) package. While these models represent a well-defined workflow for documenting adaptive evolution, in practice they can be challenging for researchers having a vast amount of data, as multiple types of relevant codon-based datasets are generated, making the overall process hard and tedious to handle, error-prone and time-consuming.ResultsWe introduce LMAP (Lightweight Multigene Analyses in PAML), a user-friendly command-line and interactive package, designed to handle the codeml workflow, namely: directory organization, execution, results gathering and organization for Likelihood Ratio Test estimations with minimal manual user intervention. LMAP was developed for the workstation multi-core environment and provides a unique advantage for processing one, or more, if not all codeml codon-based models for multiple datasets at a time. Our software, proved efficiency throughout the codeml workflow, including, but not limited, to simultaneously handling more than 20 datasets.ConclusionsWe have developed a simple and versatile LMAP package, with outstanding performance, enabling researchers to analyze multiple different codon-based datasets in a high-throughput fashion. At minimum, two file types are required within a single input directory: one for the multiple sequence alignment and another for the phylogenetic tree. To our knowledge, no other software combines all codeml codon substitution models of adaptive evolution. LMAP has been developed as an open-source package, allowing its integration into more complex open-source bioinformatics pipelines. LMAP package is released under GPLv3 license and is freely available at http://lmapaml.sourceforge.net/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1204-5) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.