A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism–based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the APPL2 gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome.
As an important source of genomic variation, copy number variation (CNV) contributes to environmental adaptation in worldwide buffaloes. Despite this importance, CNV divergence between swamp buffaloes and river buffaloes has not been studied previously. Here, we report 21 152 CNV regions (CNVRs) in 141 buffaloes of 20 breeds detected through multiple CNV calling strategies. Only 248 CNVRs were shared between river buffalo and swamp buffalo, reflecting great variation of CNVRs between the two subspecies. Population structure analysis based on CNVs successfully separated the two buffalo subspecies. We further assessed CNV divergence by calculating FST for genome‐wide CNVs. Totally, we identified 110 significantly divergent CNV segments and 44 putatively selected genes between river buffaloes and swamp buffaloes. In particular, LALBA, a key gene controlling milk production in cattle, presented a highly differentiated CNV in the promoter region, which makes it a strong functional candidate gene for differences between swamp buffaloes and river buffaloes in traits related to milk production. Our study provides useful information of CNVs in buffaloes, which may help explain the genetic differences between the two subspecies.
Other than the commonly known linear DNA in the eukaryotic nucleus, DNA can have a circular form that is similar to mitochondrial DNA (Saccone et al., 2000;Sakamoto & Takami, 2018). This was named extrachromosomal circular DNA (eccDNA; Liao et al., 2020). The eccDNA was first reported in mammals and plants as early as 1965(Hotta & Bassel, 1965. To date, the existence of eccDNA has been observed in many eukaryotes (
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.