Using Supervised Learning Methods for Gene Selection in RNA-Seq Case-Control Studies

Wenric, Stéphane; Shemirani, Ruhollah

doi:10.3389/fgene.2018.00297

Cited by 41 publications

(29 citation statements)

References 36 publications

(29 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For this, we used a random forest approach, utilizing the transcriptome data from the mouse hippocampi we recently produced (see Methods) [57]. Machine-learning methods are progressively being applied to rank ensembles of genes defined by their expression values measured with RNA-seq [74]. Using importance measures generated by the random forest algorithm, we identified groups of 40 apoptosis-related genes and 10 proliferation-related genes that together differentiate between the adult Angelman syndrome model mice and the wild-type (WT) littermates.…”

Section: Mouse Brain Rna-seq Data Also Reveals Alterations In Apoptotmentioning

confidence: 99%

Novel Insights into the Role of UBE3A in Regulating Apoptosis and Proliferation

Simchi

Panov

Morsy

et al. 2020

JCM

View full text Add to dashboard Cite

The UBE3A gene codes for a protein with two known functions, a ubiquitin E3-ligase which catalyzes ubiquitin binding to substrate proteins and a steroid hormone receptor coactivator. UBE3A is most famous for its critical role in neuronal functioning. Lack of UBE3A protein expression leads to Angelman syndrome (AS), while its overexpression is associated with autism. In spite of extensive research, our understanding of UBE3A roles is still limited. We investigated the cellular and molecular effects of Ube3a deletion in mouse embryonic fibroblasts (MEFs) and Angelman syndrome (AS) mouse model hippocampi. Cell cultures of MEFs exhibited enhanced proliferation together with reduced apoptosis when Ube3a was deleted. These findings were supported by transcriptome and proteome analyses. Furthermore, transcriptome analyses revealed alterations in mitochondria-related genes. Moreover, an analysis of adult AS model mice hippocampi also found alterations in the expression of apoptosis- and proliferation-associated genes. Our findings emphasize the role UBE3A plays in regulating proliferation and apoptosis and sheds light into the possible effects UBE3A has on mitochondrial involvement in governing this balance.

show abstract

Section: Mouse Brain Rna-seq Data Also Reveals Alterations In Apoptotmentioning

confidence: 99%

Novel Insights into the Role of UBE3A in Regulating Apoptosis and Proliferation

Simchi

Panov

Morsy

et al. 2020

JCM

View full text Add to dashboard Cite

show abstract

“…We tested this hypothesis by applying machine learning algorithms on two groups of heifers that were bred in 2015 (year one) and 2016 (year two). Parallel random forest emerged as the algorithm with over 90% efficiency of classification nearly all trials executed, which confirms the potential of accurate classification of samples using RNA-seq data under the case-control framework 60,61 . The results show that while not one single gene emerges as a potential biomarker, the accumulated information of transcript abundance from multiple genes can be powerful for the identification of fertility potential in cattle.…”

Section: Discussionmentioning

confidence: 53%

Rewiring of gene expression in circulating white blood cells is associated with pregnancy outcome in heifers (Bos taurus)

Moorey¹,

Walker

Elmore

et al. 2020

Preprint

View full text Add to dashboard Cite

16Infertility is a disease that affects humans and cattle in similar ways. The resemblance includes 17 complex genetic architecture, multiple etiology, low heritability of fertility related traits in females, 18 and the frequency in the female population. Here, we used cattle as a biomedical model to test 19 the hypothesis that gene expression profiles of protein-coding genes expressed in peripheral 20 white blood cells (PWBCs), and circulating micro RNAs in plasma, are associated with female 21 fertility, measured by pregnancy outcome. We drew blood samples from 17 female calves on the 22 day of artificial insemination and analyzed transcript abundance for 10496 genes in PWBCs and 23 290 circulating micro RNAs. The females were later classified as pregnant to artificial 24 insemination, pregnant to natural breeding or not pregnant. We identified 1860 genes producing 25 significant differential coexpression (eFDR<0.002) based on pregnancy outcome. Additionally, 26 237 micro RNAs and 2274 genes in PWBCs presented differential coexpression based on 27 pregnancy outcome. Furthermore, using a machine learning prediction algorithm we detected a 28 subset of genes whose abundance could be used for blind categorization of pregnancy outcome. 29Our results provide strong evidence that bloodborne transcript abundance is highly associated 30 with fertility in females. 42dissecting female fertility traits 20,21 . 43Beyond the importance as a biomedical model, cattle production systems provide 44 approximately 28% 22 of the protein supply globally. Improving cattle production efficiency is 45 essential for farmers to attain sustainable production and support the growing demand for animal 46 protein 22 . Infertility is a major factor that hinders efficiency in cattle production, and it starts with 47 limited success of pregnancy in young female calves. First breeding success greatly influences 48 the lifetime efficiency of beef replacement heifers. Heifers that calve early in their first calving 49 season experience increased productivity and longevity than their later calving herd 50 mates 23,24,25,26 . Furthermore, the genetic correlation between yearling pregnancy rate and lifetime 51 pregnancy rate is high (0.92-0.97) 27,28 . Therefore, the ability to identify heifers that experience 52 optimal fertility during the first breeding is essential to the sustainability of beef cattle production 53 systems. 54The examination of the genetic components of fertility in beef heifers have yielded several 55 genes potentially associated with fertility traits 5,6,7,8,9,10,11,12 , but the effect of these markers are 56 minimal, and there is no clear redundancy in genetic markers identified across breeds. Beyond 57 the genomic profiling, the analysis of multiple layers of an individual's molecular blueprint is likely 58 key for understanding the underlying biology of complex traits 29 . In line with this rationale, 59 expression-trait association studies have emerged as a means to better understand complex 60 traits 30,31 . Specificall...

show abstract

“…We tested this hypothesis by applying machine learning algorithms on two groups of heifers that were bred in 2015 (year one) and 2016 (year two). Parallel random forest emerged as the algorithm with over 90% efficiency of classification nearly all trials executed, which confirms the potential of accurate classification of samples using RNA-seq data under the case-control framework 69,70 . The results show that while not one single gene emerges as a potential biomarker, the accumulated information of transcript abundance from multiple genes can be powerful for the identification of fertility potential in cattle.…”

Section: Discussionmentioning

confidence: 54%

Rewiring of gene expression in circulating white blood cells is associated with pregnancy outcome in heifers (Bos taurus)

Moorey

Walker

Elmore

et al. 2020

Sci Rep

View full text Add to dashboard Cite

Infertility is a challenging phenomenon in cattle that reduces the sustainability of beef production worldwide. Here, we tested the hypothesis that gene expression profiles of protein-coding genes expressed in peripheral white blood cells (PWBCs), and circulating micro RNAs in plasma, are associated with female fertility, measured by pregnancy outcome. We drew blood samples from 17 heifers on the day of artificial insemination and analyzed transcript abundance for 10,496 genes in PWBCs and 290 circulating micro RNAs. The females were later classified as pregnant to artificial insemination, pregnant to natural breeding or not pregnant. We identified 1860 genes producing significant differential coexpression (eFDR < 0.002) based on pregnancy outcome. Additionally, 237 micro RNAs and 2274 genes in PWBCs presented differential coexpression based on pregnancy outcome. Furthermore, using a machine learning prediction algorithm we detected a subset of genes whose abundance could be used for blind categorization of pregnancy outcome. Our results provide strong evidence that transcript abundance in circulating white blood cells is associated with fertility in heifers.

show abstract

Using Supervised Learning Methods for Gene Selection in RNA-Seq Case-Control Studies

Cited by 41 publications

References 36 publications

Novel Insights into the Role of UBE3A in Regulating Apoptosis and Proliferation

Novel Insights into the Role of UBE3A in Regulating Apoptosis and Proliferation

Rewiring of gene expression in circulating white blood cells is associated with pregnancy outcome in heifers (Bos taurus)

Rewiring of gene expression in circulating white blood cells is associated with pregnancy outcome in heifers (Bos taurus)

Contact Info

Product

Resources

About