Chaochao Yan scite author profile

The transition of terrestrial snakes to marine life approximately 10 million years ago (Ma) is ideal for exploring adaptive evolution. Sea snakes possess phenotype specializations including laterally compressed bodies, paddle-shaped tails, valvular nostrils, cutaneous respiration, elongated lungs and salt glands yet knowledge on the genetic underpinnings of the transition remain limited. Herein, we report the first genome of Shaw’s sea snake (Hydrophis curtus) and use it to investigate sea snake secondary marine adaptation. A hybrid assembly strategy obtains a high quality genome. Gene family analyses date a pulsed coding-gene expansion to about 20 Ma, and these genes associate strongly with adaptations to marine environments. Analyses of selection pressure and convergent evolution discover the rapid evolution of protein-coding genes, and some convergent features. Additionally, 108 conserved non-coding elements appear to have evolved quickly, and these may underpin the phenotypic changes. Transposon elements may contribute to adaptive specializations by inserting into genomic regions around functionally related coding genes. The integration of genomic and transcriptomic analyses indicates independent origins and different components in sea snake and terrestrial snake venom; the venom gland of the sea snake harbours the highest PLA2 (17.23%) expression in selected elapids and these genes may organize tandemly in the genome. These analyses provide insights into the genetic mechanisms that underlay the secondary adaptation to marine and venom production of this sea snake.

show abstract

Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes

Yan

et al. 2016

Aging

View full text Add to dashboard Cite

As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5′UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3′UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5′UTRs > CDSs > 3′UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5′UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.

show abstract

The draft genome sequence of forest musk deer (Moschus berezovskii)

Fan

Jin

et al. 2018

View full text Add to dashboard Cite

BackgroundThe forest musk deer, Moschus berezovskii, is one of seven musk deer (Moschus spp.) and is distributed in Southwest China. Akin to other musk deer, the forest musk deer has been traditionally and is currently hunted for its musk (i.e., global perfume industry). Considerable hunting pressure and habitat loss have caused significant population declines. Consequently, the Chinese government commenced captive breeding programs for musk harvesting in the 1950s. However, the prevalence of fatal diseases is considerably restricting population increases. Disease severity and extent are exacerbated by inbreeding and genetic diversity declines in captive musk deer populations. It is essential that knowledge of captive and wild forest musk deer populations' immune system and genome be gained in order to improve their physical and genetic health. We have thus sequenced the whole genome of the forest musk deer, completed the genomic assembly and annotation, and performed preliminary bioinformatic analyses.FindingsA total of 407 Gb raw reads from whole-genome sequencing were generated using the Illumina HiSeq 4000 platform. The final genome assembly is around 2.72 Gb, with a contig N50 length of 22.6 kb and a scaffold N50 length of 2.85 Mb. We identified 24,352 genes and found that 42.05% of the genome is composed of repetitive elements. We also detected 1,236 olfactory receptor genes. The genome-wide phylogenetic tree indicated that the forest musk deer was within the order Artiodactyla, and it appeared as the sister clade of four members of Bovidae. In total, 576 genes were under positive selection in the forest musk deer lineage.ConclusionsWe provide the first genome sequence and gene annotation for the forest musk deer. The availability of these resources will be very useful for the conservation and captive breeding of this endangered and economically important species and for reconstructing the evolutionary history of the order Artiodactyla.

show abstract

The complete mitochondrial genome of Accipiter virgatus and evolutionary history of the pseudo-control regions in Falconiformes

Song

Huang

Yan

et al. 2015

Biochemical Systematics and Ecology

View full text Add to dashboard Cite

Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes

Jiang

Yan

et al. 2018

Sci Rep

View full text Add to dashboard Cite

As the first examination of distribution, guanine-cytosine (GC) pattern, and variation analysis of microsatellites (SSRs) in different genomic regions of six bovid species, SSRs displayed nonrandomly distribution in different regions. SSR abundances are much higher in the introns, transposable elements (TEs), and intergenic regions compared to the 3′-untranslated regions (3′UTRs), 5′UTRs and coding regions. Trinucleotide perfect SSRs (P-SSRs) were the most frequent in the coding regions, whereas, mononucleotide P-SSRs were the most in the introns, 3′UTRs, TEs, and intergenic regions. Trifold P-SSRs had more GC-contents in the 5′UTRs and coding regions than that in the introns, 3′UTRs, TEs, and intergenic regions, whereas mononucleotide P-SSRs had the least GC-contents in all genomic regions. The repeat copy numbers (RCN) of the same mono- to hexanucleotide P-SSRs showed significantly different distributions in different regions (P < 0.01). Except for the coding regions, mononucleotide P-SSRs had the most RCNs, followed by the pattern: di- > tri- > tetra- > penta- > hexanucleotide P-SSRs in the same regions. The analysis of coefficient of variability (CV) of SSRs showed that the CV variations of RCN of the same mono- to hexanucleotide SSRs were relative higher in the intronic and intergenic regions, followed by the CV variation of RCN in the TEs, and the relative lower was in the 5′UTRs, 3′UTRs, and coding regions. Wide SSR analysis of different genomic regions has helped to reveal biological significances of their distributions.

show abstract

MEANGS: an efficient seed-free tool for de novo assembling animal mitochondrial genome using whole genome NGS data

Song

Yan

2021

View full text Add to dashboard Cite

Advances in next-generation sequencing (NGS) technologies have led to an exponential increase in the number of whole genome sequences (WGS) in databases. This wealth of WGS data has greatly facilitated the recovery of full mitochondrial genomes (mitogenomes), which are vital for phylogenetic, evolutionary and ecological studies. Unfortunately, most existing software cannot easily assemble mitogenome reference sequences conveniently or efficiently. Therefore, we developed a seed-free de novo assembly tool, MEANGS, which applies the trie-search method to extend contigs from self-discovery seeds and assemble a mitogenome from animal WGS data. We then used data from 16 species with different qualities to compare the performance of MEANGS with three other available programs. MEANGS exhibited the best overall performance since it was the only one that completed all tests, and it assembled full or partial mitogenomes for all of the tested samples while the others failed. Furthermore, MEANGS selects superior assembly sequences and annotates protein-coding genes. Thus, MEANGS can be one of the most efficient software for generating high-quality mitogenomes so far, the further use of it will benefit the study on mitogenome based on whole genome NGS data. MEANGS is available at https://github.com/YanCCscu/meangs.

show abstract

Complete mitochondrial genome sequence ofNectogale elegans

Huang¹,

Yan²,

Tan

et al. 2013

Mitochondrial DNA

View full text Add to dashboard Cite

The elegant water shrew (Nectogale elegans) belongs to the family Soricidae, and distributes in northern South Asia, central and southern China and northern Southeast Asia. In this study, the complete mitochondrial genome of N. elegans was sequenced. It was determined to be 17,460 bases, and included 13 protein-coding genes (PCGs), 22 tRNA genes, 2 ribosomal RNA genes and one non-coding region, which is similar to other mammalian mitochondrial genomes. Bayesian inference and maximum likelihood methods were used to construct phylogenetic trees based on 12 heavy-strand concatenated PCGs. Phylogenetic analyses further confirmed that Crocidurinae diverged prior to Soricinae, and Sorex unguiculatus differentiated earlier than N. elegans.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chaochao Yan

Studies on the flavones using liquid chromatography–electrospray ionization tandem mass spectrometry

The genome of Shaw’s sea snake (Hydrophis curtus) reveals secondary adaptation to its marine environment

Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes

The draft genome sequence of forest musk deer (Moschus berezovskii)

The complete mitochondrial genome of Accipiter virgatus and evolutionary history of the pseudo-control regions in Falconiformes

Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes

MEANGS: an efficient seed-free tool for de novo assembling animal mitochondrial genome using whole genome NGS data

Complete mitochondrial genome sequence ofNectogale elegans

Contact Info

Product

Resources

About