Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved “open consent” process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain—we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.
To date, no effective method exists that predicts the response to preoperative chemoradiation (CRT) in locally advanced rectal cancer (LARC). Nevertheless, identification of patients who have a higher likelihood of responding to preoperative CRT could be crucial in decreasing treatment morbidity and avoiding expensive and time-consuming treatments. The aim of this study was to identify signatures or molecular markers related to response to pre-operative CRT in LARC. We analyzed the gene expression profiles of 26 pre-treatment biopsies of LARC (10 responders and 16 non-responders) without metastasis using Human WG CodeLink microarray platform. Two hundred and fifty seven genes were differentially over-expressed in the responder patient subgroup. Ingenuity Pathway Analysis revealed a significant ratio of differentially expressed genes related to cancer, cellular growth and proliferation pathways, and c-Myc network. We demonstrated that high Gng4, c-Myc, Pola1, and Rrm1 mRNA expression levels was a significant prognostic factor for response to treatment in LARC patients (p<0.05). Using this gene set, we were able to establish a new model for predicting the response to CRT in rectal cancer with a sensitivity of 60% and 100% specificity. Our results reflect the value of gene expression profiling to gain insight about the molecular pathways involved in the response to treatment of LARC patients. These findings could be clinically relevant and support the use of mRNA levels when aiming to identify patients who respond to CRT therapy.
There has been a high local recurrence rate in rectal cancer. Besides improvements in surgical techniques, both neoadjuvant short-course radiotherapy and long-course chemoradiation improve oncological results. Approximately 40–60% of rectal cancer patients treated with neoadjuvant chemoradiation achieve some degree of pathologic response. However, there is no effective method of predicting which patients will respond to neoadjuvant treatment. Recent studies have evaluated the potential of genetic biomarkers to predict outcome in locally advanced rectal adenocarcinoma treated with neoadjuvant chemoradiation. The articles produced by the PubMed search were reviewed for those specifically addressing a genetic profile's ability to predict response to neoadjuvant treatment in rectal cancer. Although tissue gene microarray profiling has led to promising data in cancer, to date, none of the identified signatures or molecular markers in locally advanced rectal cancer has been successfully validated as a diagnostic or prognostic tool applicable to routine clinical practice.
Administration of chemoradiation before tumor resection has revolutionized the management of locally advanced rectal cancer, but many patients have proven resistant to this preoperative therapy. Our group recently reported a negative correlation between c-Myc gene expression and this resistance. In the present study, integrated analysis of miRNA and mRNA expression profiles was conducted in 45 pre-treatment rectal tumors in order to analyze the expressions of miRNAs and c-Myc and their relationship with clinicopathological factors and patient survival. Twelve miRNAs were found to be differentially expressed by responders and non-responders to the chemoradiation. Functional classification revealed an association between the differentially expressed miRNAs and c-Myc. Quantitative real-time PCR results showed that miRNA-148 and miRNA-375 levels were both significantly lower in responders than in non-responders. Notably, a significant negative correlation was found between miRNA-375 expression and c-Myc expression. According to these findings, miRNA-375 and its targeted c-Myc may be useful as a predictive biomarker of the response to neoadjuvant treatment in patients with locally advanced rectal cancer.
BackgroundPrioritization methods have become an useful tool for mining large amounts of data to suggest promising hypotheses in early research stages. Particularly, network-based prioritization tools use a network representation for the interactions between different biological entities to identify novel indirect relationships. However, current network-based prioritization tools are strongly tailored to specific domains of interest (e.g. gene-disease prioritization) and they do not allow to consider networks with more than two types of entities (e.g. genes and diseases). Therefore, the direct application of these methods to accomplish new prioritization tasks is limited.ResultsThis work presents ProphNet, a generic network-based prioritization tool that allows to integrate an arbitrary number of interrelated biological entities to accomplish any prioritization task. We tested the performance of ProphNet in comparison with leading network-based prioritization methods, namely rcNet and DomainRBF, for gene-disease and domain-disease prioritization, respectively. The results obtained by ProphNet show a significant improvement in terms of sensitivity and specificity for both tasks. We also applied ProphNet to disease-gene prioritization on Alzheimer, Diabetes Mellitus Type 2 and Breast Cancer to validate the results and identify putative candidate genes involved in these diseases.ConclusionsProphNet works on top of any heterogeneous network by integrating information of different types of biological entities to rank entities of a specific type according to their degree of relationship with a query set of entities of another type. Our method works by propagating information across data networks and measuring the correlation between the propagated values for a query and a target sets of entities. ProphNet is available at: http://genome2.ugr.es/prophnet. A Matlab implementation of the algorithm is also available at the website.
Agglomerating results from studies of individual biological components has shown the potential to produce biomedical discovery and the promise of therapeutic development. Such knowledge integration could be tremendously facilitated by automated text mining for relation extraction in biomedical literature. Relation extraction systems cannot be developed without substantial datasets annotated with ground truth for benchmarking and training. The creation of such datasets is hampered by the absence of a resource for launching a distributed annotation effort, as well as by the lack of a standardized annotation schema. We have developed an annotation schema and an annotation tool which can be widely adopted so that the resulting annotated corpora from a multitude of disease studies could be assembled into a unified benchmark dataset. The contribution of this paper is threefold. First, we provide an overview of available benchmark corpora and derive a simple annotation schema for specific binary relation extraction problems such as protein-protein and gene-disease relation extraction. Second, we present BioNotate: an open source annotation resource for the distributed creation of a large corpus. Third, we present and make available the results of a pilot annotation effort of the autism disease network.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.