Background: Endometriosis is a common gynecological disorder that usually causes infertility, pelvic pain, and ovarian masses. This study aimed to mine the characteristic genes of endometriosis, and explore the regulatory mechanism and potential therapeutic drugs based on whole transcriptome sequencing data and resources from public databases, providing a theoretical basis for the diagnosis and treatment of endometriosis.Methods: The transcriptome data of the five eutopic (EU) and ectopic (EC) endometrium samples were obtained from Beijing Obstetrics and Gynecology Hospital, Beijing, China, and dinified as the own data set. The expression and clinical data of EC and EU samples in GSE25628 and GSE7305 datasets were obtained from the GEO database (https://www.ncbi.nlm.nih.gov/gds). Differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) were used to identify the endometriosis-related differentially expressed genes. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were conducted by the “clusterProfiler” R package. Then, characteristic genes for endometriosis were identified by the least absolute shrinkage and selection operator (LASSO) and support vector machine recursive feature elimination (SVM-RFE) algorithm. The expression of characteristic genes was verified by quantitative reverse transcription polymerase chain reaction (qRT-PCR) and western-blot. The receiver operating characteristic (ROC) curve was used to evaluate the discriminatory ability of characteristic genes. We assessed the abundance of infiltrating immune cells in each sample using MCP-counter and ImmuCellAI algorithms. The competitive endogenous RNA (ceRNA) regulatory network of characteristic genes was created by Cytoscape and potential targeting drugs were obtained in the CTD database.Results: 44 endometriosis-related differentially expressed genes were obtained from GSE25628 and the own dataset. Subsequently, LASSO and SVM-RFE algorithms identified four characteristic genes, namely ACLY, PTGFR, ADH1B, and MYOM1. The results of RT-PCR and western-blot were consistent with those of sequencing. The result of ROC curves indicated that the characteristic genes had powerful abilities in distinguishing EC samples from EU samples. Infiltrating immune cells analysis suggested that there was a certain difference in immune microenvironment between EC and EU samples. The characteristic genes were significantly correlated with specific differential immune cells between EC and EU samples. Then, a ceRNA regulatory network of characteristic genes was constructed and showed a total of 7, 11, 11, and 1 miRNA associated with ACLY, ADH1B, PTGFR, and MYOM1, respectively. Finally, we constructed a gene-compound network and mined 30 drugs targeting ACLY, 33 drugs targeting ADH1B, 13 drugs targeting MYOM1, and 12 drugs targeting PTGFR.Conclusion: Comprehensive bioinformatic analysis was used to identify characteristic genes, and explore ceRNA regulatory network and potential therapeutic agents for endometriosis. Altogether, these findings provide new insights into the diagnosis and treatment of endometriosis.
The precision and expense of biological research have grown in inverse proportion during decades of ongoing bioinformatics technology development, with steadily increasing accuracy and gradually decreasing spending. In the meantime, the demand for data analysis has skyrocketed for questions that have remained unanswered since the relatively mature biological sector. During the 1990s and 2000s, considerable advances in sequencing technology, combined with lower costs, resulted in an exponential increase in data analysis demand. The emergence of 'Big Data' has posed new data mining and administration issues, necessitating the incorporation of greater computer science knowledge. Traditional experimental approaches have started to demonstrate some flaws, such as inefficiency and inaccuracy. Combining earlier experimental methodologies with modern bioinformatics data analysis approaches could be a great place to meet today's more demanding data analysis demands. However, numerous opportunities for boosting efficacy using bioinformatic data analytic applications are still being summarised. This review will summarize the current application of the latest bioinformatics technology in biological research disciplines such as biomedicine, medicine, and public health. By evaluating significant biology data with current-updating software, it could better comprehend how gene sequencing, medicine, and diagnostic screening method exploration, quick coping, and update in public health study might be better achieved.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.