RNA-Seq Data for Reliable SNP Detection and Genotype Calling: Interest for Coding Variant Characterization and Cis-Regulation Analysis by Allele-Specific Expression in Livestock Species
Abstract:In addition to their common usages to study gene expression, RNA-seq data accumulated over the last 10 years are a yet-unexploited resource of SNPs in numerous individuals from different populations. SNP detection by RNA-seq is particularly interesting for livestock species since whole genome sequencing is expensive and exome sequencing tools are unavailable. These SNPs detected in expressed regions can be used to characterize variants affecting protein functions, and to study cis-regulated genes by analyzing … Show more
“…Therefore, we reasoned that patient genotyping from mRNA could better reflect the ACE2 variants expressed in those patients. In fact, previous studies have shown that SNPs could be detected with high precision in transcriptome sequencing approaches as compared to DNA-seq procedures (58, 59). This has led to the emergence of transcriptome or RNA sequencing as a potential alternative approach to variant detection within protein coding regions, since the transcriptome of a given tissue represents a quasi-complete set of transcribed genes (mRNAs) and other noncoding RNAs which also bypasses the need for exome enrichment (60, 61).…”
Coronavirus disease 19 (COVID-19) is a persistent global pandemic with a very heterogeneous disease presentation ranging from a mild disease to dismal prognosis. Early detection of sensitivity and severity of COVID-19 is essential for the development of new treatments. In the present study, we measured the levels of circulating growth differentiation factor 15 (GDF15) and angiotensin-converting enzyme 2 (ACE2) in plasma of severity-stratified COVID-19 patients and healthy control patients and characterized the in vitro effects and cohort frequency of ACE2 SNPs. Our results show that while circulating GDF15 and ACE2 stratify COVID-19 patients according to disease severity, ACE2 missense SNPs constitute a risk factor linked to infection susceptibility.
“…Therefore, we reasoned that patient genotyping from mRNA could better reflect the ACE2 variants expressed in those patients. In fact, previous studies have shown that SNPs could be detected with high precision in transcriptome sequencing approaches as compared to DNA-seq procedures (58, 59). This has led to the emergence of transcriptome or RNA sequencing as a potential alternative approach to variant detection within protein coding regions, since the transcriptome of a given tissue represents a quasi-complete set of transcribed genes (mRNAs) and other noncoding RNAs which also bypasses the need for exome enrichment (60, 61).…”
Coronavirus disease 19 (COVID-19) is a persistent global pandemic with a very heterogeneous disease presentation ranging from a mild disease to dismal prognosis. Early detection of sensitivity and severity of COVID-19 is essential for the development of new treatments. In the present study, we measured the levels of circulating growth differentiation factor 15 (GDF15) and angiotensin-converting enzyme 2 (ACE2) in plasma of severity-stratified COVID-19 patients and healthy control patients and characterized the in vitro effects and cohort frequency of ACE2 SNPs. Our results show that while circulating GDF15 and ACE2 stratify COVID-19 patients according to disease severity, ACE2 missense SNPs constitute a risk factor linked to infection susceptibility.
“…Data generated have been deposited in NCBI database: PRJNA766745. The male and female gonad RNAseq data from previous reports include E4.5 and E6 RNA-seq: PRJNA171809 [ 29 ], E10, E12 and E14 RNA-seq: PRJEB26695 [ 30 ].…”
Background
Birds exhibit a unique asymmetry in terms of gonad development. The female left gonad generates a functional ovary, whereas the right gonad regresses. In males, both left and right gonads would develop into testes. How is this left/right asymmetry established only in females but not in males remains unknown. The epigenetic regulation of gonadal developmental genes may contribute to this sex disparity. The modification of histone tails such as H3K27ac is tightly coupled to chromatin activation and gene expression. To explore whether H3K27ac marked chromatin activation is involved in the asymmetric development of avian gonads, we probed genome-wide H3K27ac occupancy in left and right gonads from both sexes and related chromatin activity profile to the expression of gonadal genes. Furthermore, we validated the effect of chromatin activity on asymmetric gonadal development by manipulating the chromatin histone acetylation levels.
Methods
The undifferentiated gonads from both sides of each sex were collected and subjected to RNA-Seq and H3K27ac ChIP-Seq experiments. Integrated analysis of gene expression and active chromatin regions were performed to identify the sex- and situs-specific regulation and expression of gonadal genes. The histone deacetylase inhibitor trichostatin A (TSA) was applied to the undifferentiated female right gonads to assess the effect of chromatin activation on gonadal gene expression and cell proliferation.
Results
Even before sex differentiation, the gonads already show divergent gene expression between different sexes and between left/right sides in females. The sex-specific H3K27ac chromatin distributions coincide with the higher expression of male/female specification genes in each sex. Unexpectedly, the H3K27ac marked chromatin activation show a dramatic difference between left and right gonads in both sexes, although the left/right asymmetric gonadal development was observed only in females but not in males. In females, the side-specific H3K27ac occupancy instructs the differential expression of developmental genes between the pair of gonads and contributes to the development of left but not right gonad. However, in males, the left/right discrepancy of H3K27ac chromatin distribution does not drive the side-biased gene expression or gonad development. The TSA-induced retention of chromatin acetylation causes up-regulation of ovarian developmental genes and increases cell proliferation in the female right gonad.
Conclusions
We revealed that left/right asymmetry in H3K27ac marked chromatin activation exists in both sexes, but this discrepancy gives rise to asymmetric gonadal development only in females. Other mechanisms overriding the chromatin activation would control the symmetric development of male gonads in chicken.
“…The 3,276,615 SNPs analyzed in this study have been detected following the method presented in the companion paper (Jehl et al, 2021) using 767 multi-tissue RNA-seq of 382 birds from 11 chicken populations (see Additional File 1). This SNP set corresponds to the union of the SNPs with reliable genotypes found in each population (list available on http://www.…”
Section: Snp Datasetmentioning
confidence: 99%
“…Using 3.3M SNPs previously detected from 767 multi-tissue RNA-seq of 382 animals from 11 chicken populations and therefore enriched in coding regions [see the companion paper (Jehl et al, 2021), section "Materials and Methods"], we identified 260,919 unique SNPs in 26,702 transcripts corresponding to 15,835 genes out of 19,545 protein-coding genes (Figure 2, right part-in yellow).…”
Section: Read-based Phasing For Identification Of Mnvsmentioning
confidence: 99%
“…Considering this aim, we used 9.5M SNPs recently detected in 382 chickens from 767 multi-tissue RNA-seq, enriched by construction in expressed regions and therefore in protein-coding regions. From this 9.5M SNPs, we focused on the 3.3M SNPs with reliable genotypes [see the companion paper (Jehl et al, 2021)]. MNV identification requires properly phased variants, i.e., to be located either on the same haplotype (called therefore MNV) or on two different haplotypes (a case of individual SNPs) (see Figures 1A,B).…”
Most single-nucleotide polymorphisms (SNPs) are located in non-coding regions, but the fraction usually studied is harbored in protein-coding regions because potential impacts on proteins are relatively easy to predict by popular tools such as the Variant Effect Predictor. These tools annotate variants independently without considering the potential effect of grouped or haplotypic variations, often called “multi-nucleotide variants” (MNVs). Here, we used a large RNA-seq dataset to survey MNVs, comprising 382 chicken samples originating from 11 populations analyzed in the companion paper in which 9.5M SNPs— including 3.3M SNPs with reliable genotypes—were detected. We focused our study on in-codon MNVs and evaluate their potential mis-annotation. Using GATK HaplotypeCaller read-based phasing results, we identified 2,965 MNVs observed in at least five individuals located in 1,792 genes. We found 41.1% of them showing a novel impact when compared to the effect of their constituent SNPs analyzed separately. The biggest impact variation flux concerns the originally annotated stop-gained consequences, for which around 95% were rescued; this flux is followed by the missense consequences for which 37% were reannotated with a different amino acid. We then present in more depth the rescued stop-gained MNVs and give an illustration in the SLC27A4 gene. As previously shown in human datasets, our results in chicken demonstrate the value of haplotype-aware variant annotation, and the interest to consider MNVs in the coding region, particularly when searching for severe functional consequence such as stop-gained variants.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.