The decrease in sequencing cost and increased sophistication of assembly algorithms for short-read platforms has resulted in a sharp increase in the number of species with genome assemblies. However, these assemblies are highly fragmented, with many gaps, ambiguities, and errors, impeding downstream applications. We demonstrate current state of the art for de novo assembly using the domestic goat (Capra hircus), based on long reads for contig formation, short reads for consensus validation, and scaffolding by optical and chromatin interaction mapping. These combined technologies produced the most continuous de novo mammalian assembly to date, with chromosome-length scaffolds and only 649 gaps. Our assembly represents a ~400-fold improvement in continuity due to properly assembled gaps compared to the previously published C. hircus assembly, and better resolves repetitive structures longer than 1 kb, representing the largest repeat family and immune gene complex ever produced for an individual of a ruminant species.
Background Major advances in selection progress for cattle have been made following the introduction of genomic tools over the past 10–12 years. These tools depend upon the Bos taurus reference genome (UMD3.1.1), which was created using now-outdated technologies and is hindered by a variety of deficiencies and inaccuracies. Results We present the new reference genome for cattle, ARS-UCD1.2, based on the same animal as the original to facilitate transfer and interpretation of results obtained from the earlier version, but applying a combination of modern technologies in a de novo assembly to increase continuity, accuracy, and completeness. The assembly includes 2.7 Gb and is >250× more continuous than the original assembly, with contig N50 >25 Mb and L50 of 32. We also greatly expanded supporting RNA-based data for annotation that identifies 30,396 total genes (21,039 protein coding). The new reference assembly is accessible in annotated form for public use. Conclusions We demonstrate that improved continuity of assembled sequence warrants the adoption of ARS-UCD1.2 as the new cattle reference genome and that increased assembly accuracy will benefit future research on this species.
Human killer cell immunoglobulin-like receptors (KIRs) are distinguished by expansion of activating KIR2DS, whose ligands and functions remain poorly understood. The oldest, most prevalent KIR2DS is KIR2DS4, which is represented by a variable balance between “full-length” and “deleted” forms. We find that full-length 2DS4 is a human histocompatibility leukocyte antigen (HLA) class I receptor that binds specifically to subsets of C1+ and C2+ HLA-C and to HLA-A*11, whereas deleted 2DS4 is nonfunctional. Activation of 2DS4+ NKL cells was achieved with A*1102 as ligand, which differs from A*1101 by unique substitution of lysine 19 for glutamate, but not with A*1101 or HLA-C. Distinguishing KIR2DS4 from other KIR2DS is the proline–valine motif at positions 71–72, which is shared with KIR3DL2 and was introduced by gene conversion before separation of the human and chimpanzee lineages. Site-directed swap mutagenesis shows that these two residues are largely responsible for the unique HLA class I specificity of KIR2DS4. Determination of the crystallographic structure of KIR2DS4 shows two major differences from KIR2DL: displacement of contact loop L2 and altered bonding potential because of the substitutions at positions 71 and 72. Correlation between the worldwide distributions of functional KIR2DS4 and HLA-A*11 points to the physiological importance of their mutual interaction.
SARS Coronavirus 2 (SARS-CoV-2) emerged in late 2019, leading to the Coronavirus Disease 2019 (COVID-19) pandemic that continues to cause significant global mortality in human populations. Given its sequence similarity to SARS-CoV, as well as related coronaviruses circulating in bats, SARS-CoV-2 is thought to have originated in Chiroptera species in China. However, whether the virus spread directly to humans or through an intermediate host is currently unclear, as is the potential for this virus to infect companion animals, livestock, and wildlife that could act as viral reservoirs. Using a combination of surrogate entry assays and live virus, we demonstrate that, in addition to human angiotensin-converting enzyme 2 (ACE2), the Spike glycoprotein of SARS-CoV-2 has a broad host tropism for mammalian ACE2 receptors, despite divergence in the amino acids at the Spike receptor binding site on these proteins. Of the 22 different hosts we investigated, ACE2 proteins from dog, cat, and cattle were the most permissive to SARS-CoV-2, while bat and bird ACE2 proteins were the least efficiently used receptors. The absence of a significant tropism for any of the 3 genetically distinct bat ACE2 proteins we examined indicates that SARS-CoV-2 receptor usage likely shifted during zoonotic transmission from bats into people, possibly in an intermediate reservoir. Comparison of SARS-CoV-2 receptor usage to the related coronaviruses SARS-CoV and RaTG13 identified distinct tropisms, with the 2 human viruses being more closely aligned. Finally, using bioinformatics, structural data, and targeted mutagenesis, we identified amino acid residues within the Spike–ACE2 interface, which may have played a pivotal role in the emergence of SARS-CoV-2 in humans. The apparently broad tropism of SARS-CoV-2 at the point of viral entry confirms the potential risk of infection to a wide range of companion animals, livestock, and wildlife.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.