The rich fossil record of equids has made them a model for evolutionary processes. Here we present a 1.12-times coverage draft genome from a horse bone recovered from permafrost dated to approximately 560-780 thousand years before present (kyr BP). Our data represent the oldest full genome sequence determined so far by almost an order of magnitude. For comparison, we sequenced the genome of a Late Pleistocene horse (43 kyr BP), and modern genomes of five domestic horse breeds (Equus ferus caballus), a Przewalski's horse (E. f. przewalskii) and a donkey (E. asinus). Our analyses suggest that the Equus lineage giving rise to all contemporary horses, zebras and donkeys originated 4.0-4.5 million years before present (Myr BP), twice the conventionally accepted time to the most recent common ancestor of the genus Equus. We also find that horse population size fluctuated multiple times over the past 2 Myr, particularly during periods of severe climatic changes. We estimate that the Przewalski's and domestic horse populations diverged 38-72 kyr BP, and find no evidence of recent admixture between the domestic horse breeds and the Przewalski's horse investigated. This supports the contention that Przewalski's horses represent the last surviving wild horse population. We find similar levels of genetic variation among Przewalski's and domestic populations, indicating that the former are genetically viable and worthy of conservation efforts. We also find evidence for continuous selection on the immune system and olfaction throughout horse evolution. Finally, we identify 29 genomic regions among horse breeds that deviate from neutrality and show low levels of genetic variation compared to the Przewalski's horse. Such regions could correspond to loci selected early during domestication.
Calcified dental plaque (dental calculus) preserves for millennia and entraps biomolecules from all domains of life and viruses. We report the first high-resolution taxonomic and protein functional characterization of the ancient oral microbiome and demonstrate that the oral cavity has long served as a reservoir for bacteria implicated in both local and systemic disease. We characterize: (i) the ancient oral microbiome in a diseased state, (ii) 40 opportunistic pathogens, (iii) the first evidence of ancient human-associated putative antibiotic resistance genes, (iv) a genome reconstruction of the periodontal pathogen Tannerella forsythia, (v) 239 bacterial and 43 human proteins, allowing confirmation of a long-term association between host immune factors, “red-complex” pathogens, and periodontal disease, and (vi) DNA sequences matching dietary sources. Directly datable and nearly ubiquitous, dental calculus permits the simultaneous investigation of pathogen activity, host immunity, and diet, thereby extending the direct investigation of common diseases into the human evolutionary past.
SUMMARYLysine acetylation is a major posttranslational modification involved in a broad array of physiological functions. Here, we provide an organ-wide map of lysine acetylation sites from 16 rat tissues analyzed by high-resolution tandem mass spectrometry. We quantify 15,474 modification sites on 4,541 proteins and provide the data set as a web-based database. We demonstrate that lysine acetylation displays site-specific sequence motifs that diverge between cellular compartments, with a significant fraction of nuclear sites conforming to the consensus motifs G-AcK and AcK-P. Our data set reveals that the subcellular acetylation distribution is tissue-type dependent and that acetylation targets tissue-specific pathways involved in fundamental physiological processes. We compare lysine acetylation patterns for rat as well as human skeletal muscle biopsies and demonstrate its general involvement in muscle contraction. Furthermore, we illustrate that acetylation of fructose-bisphosphate aldolase and glycerol-3-phosphate dehydrogenase serves as a cellular mechanism to switch off enzymatic activity.
SummaryThis study investigates the challenge of comprehensively cataloging the complete human proteome from a single-cell type using mass spectrometry (MS)-based shotgun proteomics. We modify a classical two-dimensional high-resolution reversed-phase peptide fractionation scheme and optimize a protocol that provides sufficient peak capacity to saturate the sequencing speed of modern MS instruments. This strategy enables the deepest proteome of a human single-cell type to date, with the HeLa proteome sequenced to a depth of ∼584,000 unique peptide sequences and ∼14,200 protein isoforms (∼12,200 protein-coding genes). This depth is comparable with next-generation RNA sequencing and enables the identification of post-translational modifications, including ∼7,000 N-acetylation sites and ∼10,000 phosphorylation sites, without the need for enrichment. We further demonstrate the general applicability and clinical potential of this proteomics strategy by comprehensively quantifying global proteome expression in several different human cancer cell lines and patient tissue samples.
Quantitative phosphoproteomics has transformed investigations of cell signaling, but it remains challenging to scale the technology for high-throughput analyses. Here we report a rapid and reproducible approach to analyze hundreds of phosphoproteomes using data-independent acquisition (DIA) with an accurate site localization score incorporated into Spectronaut. DIA-based phosphoproteomics achieves an order of magnitude broader dynamic range, higher reproducibility of identification, and improved sensitivity and accuracy of quantification compared to state-of-the-art data-dependent acquisition (DDA)-based phosphoproteomics. Notably, direct DIA without the need of spectral libraries performs close to analyses using project-specific libraries, quantifying > 20,000 phosphopeptides in 15 min single-shot LC-MS analysis per condition. Adaptation of a 3D multiple regression model-based algorithm enables global determination of phosphorylation site stoichiometry in DIA. Scalability of the DIA approach is demonstrated by systematically analyzing the effects of thirty kinase inhibitors in context of epidermal growth factor (EGF) signaling showing that specific protein kinases mediate EGF-dependent phospho-regulation.
Ubiquitination is a post-translational modification (PTM) that is essential for balancing numerous physiological processes. To enable delineation of protein ubiquitination at a site-specific level, we generated an antibody, denoted UbiSite, recognizing the C-terminal 13 amino acids of ubiquitin, which remain attached to modified peptides after proteolytic digestion with the endoproteinase LysC. Notably, UbiSite is specific to ubiquitin. Furthermore, besides ubiquitination on lysine residues, protein N-terminal ubiquitination is readily detected as well. By combining UbiSite enrichment with sequential LysC and trypsin digestion and high-accuracy MS, we identified over 63,000 unique ubiquitination sites on 9,200 proteins in two human cell lines. In addition to uncovering widespread involvement of this PTM in all cellular aspects, the analyses reveal an inverse association between protein N-terminal ubiquitination and acetylation, as well as a complete lack of correlation between changes in protein abundance and alterations in ubiquitination sites upon proteasome inhibition.
Comprehensive mass spectrometry (MS)-based proteomics is now feasible, but reproducible quantification remains challenging, especially for post-translational modifications such as phosphorylation. Here, we compare the most popular quantification techniques for global phosphoproteomics: label-free quantification (LFQ), stable isotope labeling by amino acids in cell culture (SILAC) and MS2- and MS3-measured tandem mass tags (TMT). In a mixed species comparison with fixed phosphopeptide ratios, we find LFQ and SILAC to be the most accurate techniques. MS2-based TMT yields the highest precision but lowest accuracy due to ratio compression, which MS3-based TMT can partly rescue. However, MS2-based TMT outperforms MS3-based TMT when analyzing phosphoproteome changes in the DNA damage response, since its higher precision and larger identification numbers allow detection of a greater number of significantly regulated phosphopeptides. Finally, we utilize the TMT multiplexing capabilities to develop an algorithm for determining phosphorylation site stoichiometry, showing that such applications benefit from the high accuracy of MS3-based TMT.
No large group of recently extinct placental mammals remains as evolutionarily cryptic as the approximately 280 genera grouped as 'South American native ungulates'. To Charles Darwin, who first collected their remains, they included perhaps the 'strangest animal[s] ever discovered'. Today, much like 180 years ago, it is no clearer whether they had one origin or several, arose before or after the Cretaceous/Palaeogene transition 66.2 million years ago, or are more likely to belong with the elephants and sirenians of superorder Afrotheria than with the euungulates (cattle, horses, and allies) of superorder Laurasiatheria. Morphology-based analyses have proved unconvincing because convergences are pervasive among unrelated ungulate-like placentals. Approaches using ancient DNA have also been unsuccessful, probably because of rapid DNA degradation in semitropical and temperate deposits. Here we apply proteomic analysis to screen bone samples of the Late Quaternary South American native ungulate taxa Toxodon (Notoungulata) and Macrauchenia (Litopterna) for phylogenetically informative protein sequences. For each ungulate, we obtain approximately 90% direct sequence coverage of type I collagen α1- and α2-chains, representing approximately 900 of 1,140 amino-acid residues for each subunit. A phylogeny is estimated from an alignment of these fossil sequences with collagen (I) gene transcripts from available mammalian genomes or mass spectrometrically derived sequence data obtained for this study. The resulting consensus tree agrees well with recent higher-level mammalian phylogenies. Toxodon and Macrauchenia form a monophyletic group whose sister taxon is not Afrotheria or any of its constituent clades as recently claimed, but instead crown Perissodactyla (horses, tapirs, and rhinoceroses). These results are consistent with the origin of at least some South American native ungulates from 'condylarths', a paraphyletic assembly of archaic placentals. With ongoing improvements in instrumentation and analytical procedures, proteomics may produce a revolution in systematics such as that achieved by genomics, but with the possibility of reaching much further back in time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.