BackgroundSufficient knowledge of molecular and genetic interactions, which comprise the entire basis of the functioning of living systems, is one of the necessary requirements for successfully answering almost any research question in the field of biology and medicine. To date, more than 24 million scientific papers can be found in PubMed, with many of them containing descriptions of a wide range of biological processes. The analysis of such tremendous amounts of data requires the use of automated text-mining approaches. Although a handful of tools have recently been developed to meet this need, none of them provide error-free extraction of highly detailed information.ResultsThe ANDSystem package was developed for the reconstruction and analysis of molecular genetic networks based on an automated text-mining technique. It provides a detailed description of the various types of interactions between genes, proteins, microRNA's, metabolites, cellular components, pathways and diseases, taking into account the specificity of cell lines and organisms. Although the accuracy of ANDSystem is comparable to other well known text-mining tools, such as Pathway Studio and STRING, it outperforms them in having the ability to identify an increased number of interaction types.ConclusionThe use of ANDSystem, in combination with Pathway Studio and STRING, can improve the quality of the automated reconstruction of molecular and genetic networks. ANDSystem should provide a useful tool for researchers working in a number of different fields, including biology, biotechnology, pharmacology and medicine.
BackgroundConsideration of tissue-specific gene expression in reconstruction and analysis of molecular genetic networks is necessary for a proper description of the processes occurring in a specified tissue. Currently, there are a number of computer systems that allow the user to reconstruct molecular-genetic networks using the data automatically extracted from the texts of scientific publications. Examples of such systems are STRING, Pathway Commons, MetaCore and Ingenuity. The MetaCore and Ingenuity systems permit taking into account tissue-specific gene expression during the reconstruction of gene networks. Previously, we developed the ANDSystem tool, which also provides an automated extraction of knowledge from scientific texts and allows the reconstruction of gene networks. The main difference between our system and other tools is in the different types of interactions between objects, which makes the ANDSystem complementary to existing well-known systems. However, previous versions of the ANDSystem did not contain any information on tissue-specific expression.ResultsA new version of the ANDSystem has been developed. It offers the reconstruction of associative gene networks while taking into account the tissue-specific gene expression. The ANDSystem knowledge base features information on tissue-specific expression for 272 tissues. The system allows the reconstruction of combined gene networks, as well as performing the filtering of genes from such networks using the information on their tissue-specific expression. As an example of the application of such filtering, the gene network of the extrinsic apoptotic signaling pathway was analyzed. It was shown that considering different tissues can lead to changes in gene network structure, including changes in such indicators as betweenness centrality of vertices, clustering coefficient, network centralization, network density, etc.ConclusionsThe consideration of tissue specificity can play an important role in the analysis of gene networks, in particular solving the problem of finding the most significant central genes. Thus, the new version of ANDSystem can be employed for a wide range of tasks related to biomedical studies of individual tissues. It is available at http://www-bionet.sscc.ru/and/cell/.Electronic supplementary materialThe online version of this article (10.1186/s12859-018-2567-6) contains supplementary material, which is available to authorized users.
Background: Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or~22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Results: Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension -713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in biological processes related to the functioning of central nervous system.
BackgroundAnalysis of molecular markers in addition to cytological analysis of fine-needle aspiration (FNA) samples is a promising way to improve the preoperative diagnosis of thyroid nodules. Nonetheless, in clinical practice, applications of existing diagnostic solutions based on the detection of somatic mutations or analysis of gene expression are limited by their high cost and difficulties with clinical interpretation.The aim of our work was to develop an algorithm for the differential diagnosis of thyroid nodules on the basis of a small set of molecular markers analyzed by real-time PCR.MethodsA total of 494 preoperative FNA samples of thyroid goiters and tumors from 232 patients with known histological reports were analyzed: goiter, 105 samples (50 patients); follicular adenoma, 101 (48); follicular carcinoma, 43 (28); Hürthle cell carcinoma, 25 (11); papillary carcinoma, 121 (56); follicular variant of papillary carcinoma, 80 (32); and medullary carcinoma, 19 (12). Total nucleic acids extracted from dried FNA smears were analyzed for five somatic point mutations and two translocations typical of thyroid tumors as well as for relative concentrations of HMGA2 mRNA and 13 microRNAs and the ratio of mitochondrial to nuclear DNA by real-time PCR. A decision tree–based algorithm was built to discriminate benign and malignant tumors and to type the thyroid cancer. Leave-p-out cross-validation with five partitions was performed to estimate prediction quality. A comparison of two independent samples by quantitative traits was carried out via the Mann–Whitney U test.ResultsA minimum set of markers was selected (levels of HMGA2 mRNA and miR-375, − 221, and -146b in combination with the mitochondrial-to-nuclear DNA ratio) and yielded highly accurate discrimination (sensitivity = 0.97; positive predictive value = 0.98) between goiters with benign tumors and malignant tumors and accurate typing of papillary, medullary, and Hürthle cell carcinomas. The results support an alternative classification of follicular tumors, which differs from the histological one.ConclusionsThe study shows the feasibility of the preoperative differential diagnosis of thyroid nodules using a panel of several molecular markers by a simple PCR-based method. Combining markers of different types increases the accuracy of classification.
Co-existence of bronchial asthma (BA) and tuberculosis (TB) is extremely uncommon (dystropic). We assume that this is caused by the interplay between genes involved into specific pathophysiological pathways that arrest simultaneous manifestation of BA and TB. Identification of common and specific genes may be important to determine the molecular genetic mechanisms leading to rare co-occurrence of these diseases and may contribute to the identification of susceptibility genes for each of these dystropic diseases. To address the issue, we propose a new methodological strategy that is based on reconstruction of associative networks that represent molecular relationships between proteins/genes associated with BA and TB, thus facilitating a better understanding of the biological context of antagonistic relationships between the diseases. The results of our study revealed a number of proteins/genes important for the development of both BA and TB.
A study of the molecular genetics mechanisms of host-pathogen interactions is of paramount importance in developing drugs against viral diseases. Currently, the literature contains a huge amount of information that describes interactions between HCV and human proteins. In addition, there are many factual databases that contain experimentally verified data on HCV-host interactions. The sources of such data are the original data along with the data manually extracted from the literature. However, the manual analysis of scientific publications is time consuming and, because of this, databases created with such an approach often do not have complete information. One of the most promising methods to provide actualisation and completeness of information is text mining. Here, with the use of a previously developed method by the authors using ANDSystem, an automated extraction of information on the interactions between HCV and human proteins was conducted. As a data source for the text mining approach, PubMed abstracts and full text articles were used. Additionally, external factual databases were analyzed. On the basis of this analysis, a special version of ANDSystem, extended with the HCV interactome, was created. The HCV interactome contains information about the interactions between 969 human and 11 HCV proteins. Among the 969 proteins, 153 'new' proteins were found not previously referred to in any external databases of protein-protein interactions for HCV-host interactions. Thus, the extended ANDSystem possesses a more comprehensive detailing of HCV-host interactions versus other existing databases. It was interesting that HCV proteins more preferably interact with human proteins that were already involved in a large number of protein-protein interactions as well as those associated with many diseases. Among human proteins of the HCV interactome, there were a large number of proteins regulated by microRNAs. It turned out that the results obtained for protein-protein interactions and microRNA-regulation did not depend on how well the proteins were studied, while protein-disease interactions appeared to be dependent on the level of study. In particular, the mean number of diseases linked to well-studied proteins (proteins were considered well-studied if they were mentioned in 50 or more PubMed publications) from the HCV interactome was 20.8, significantly exceeding the mean number of associations with diseases (10.1) for the total set of well-studied human proteins present in ANDSystem. For proteins not highly poorly-studied investigated, proteins from the HCV interactome (each protein was referred to in less than 50 publications) distribution of the number of diseases associated with them had no statistically significant differences from the distribution of the number of diseases associated with poorly-studied proteins based on the total set of human proteins stored in ANDSystem. With this, the average number of associations with diseases for the HCV interactome and the total set of human proteins were 0.3 and 0.2, ...
AimsAnalysis of molecular markers in addition to cytological analysis of fine-needle aspiration (FNA) samples is a promising way to improve the preoperative diagnosis of thyroid nodules. Previously, we have developed an algorithm for the differential diagnosis of thyroid nodules by means of a small set of molecular markers. Here, we aimed to validate this approach using FNA cytology samples of Bethesda categories III and IV, in which preoperative detection of malignancy by cytological analysis is impossible.MethodsA total of 122 FNA smears from patients with indeterminate cytology (Bethesda III: 13 patients, Bethesda IV: 109 patients) were analysed by real-time PCR regarding the preselected set of molecular markers (the BRAF V600E mutation, normalised concentrations of HMGA2 mRNA, 3 microRNAs, and the mitochondrial/nuclear DNA ratio). The decision tree–based classifier was used to discriminate between benign and malignant tumours.ResultsThe molecular testing detected malignancy in FNA smears of indeterminate cytology with 89.2% sensitivity, 84.6% positive predictive value, 92.9% specificity and 95.2% negative predictive value; these characteristics are comparable with those of more complicated commercial tests. Residual risk of malignancy for the thyroid nodules that were shown to be benign by this molecular method did not exceed the reported risk of malignancy for Bethesda II histological diagnosis. Analytical-accuracy assessment revealed required nucleic-acid input of ≥5 ng.ConclusionsThe study shows feasibility of preoperative differential diagnosis of thyroid nodules of indeterminate cytology using a small panel of molecular markers of different types by a simple PCR-based method using stained FNA smears.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.