MotivationProtein dynamic is essential for cellular functions. Due to the complex nature of non-covalent interactions and their long-range effects, the analysis of protein conformations using network theory can be enlightening. Protein Structure Networks (PSNs) rely on different philosophies, and the currently available tools suffer from limitations in terms of input formats, supported network models, and version control. Another issue is the precise definition of cutoffs for the network calculations and the assessment of the stability of the parameters, which ultimately affect the outcome of the analyses.ResultsWe provide two open-source software packages, i.e., PyInteraph2 and PyInKnife2, to implement and analyze PSNs in a harmonized, reproducible, and documented manner. PyInteraph2 interfaces with multiple formats for protein ensembles and calculates a diverse range of network models with the possibility to integrate them into a macro-network and perform further downstream graph analyses. PyInKnife2 is a standalone package that supports the network models implemented in PyInteraph2. It employs a jackknife resampling approach to estimate the convergence of network properties and streamline the selection of distance cutoffs. Several functionalities are based on MDAnalysis and NetworkX, including parallelization, and are available for Python 3.7. PyInteraph2 underwent a massive restructuring in terms of setup, installation, and test support compared to the original PyInteraph software.ConclusionsWe foresee that the modular structure of the code and the version control system of GitHub will promote the transition to a community-driven effort, boost reproducibility, and establish harmonized protocols in the PSN field. As developers, we will guarantee the introduction of new functionalities, assistance, training of new contributors, and maintenance of the package.AvailabilityThe packages are available at https://github.com/ELELAB/pyinteraph2 and https://github.com/ELELAB/PyInKnife2 with guides provided within the packages.
RNA editing is a form of post-transcriptional modification that results in changes to the messenger RNA sequence. At the onset of the study we focused on detecting the changes in RNA editing patterns in cell lines exposed to hypoxic conditions followed by the detection of changes in RNA editing patterns in the fetuses of preeclamptic mothers using publicly available RNA sequence data from the NCBI SRA database. The results showed an increase in RNA editing activity in hypoxic cell lines and a decrease in RNA editing activity in the fetuses with preeclamptic mothers. A total of 85 genes common in the cell lines and 33 in the fetus disease models and not present in controls were identified as harboring editing sites in exonic, downstream, upstream or splicing regions. Subsequently we focused on unique editing sites in genes and categorized in order of relevance to Preeclampsia as A, B and C (A being most closely related to the disease and C the least). The genes implicated ones involved in respiration chains, blood cell growth, cytokine and complement activation. Among the most significant of the genes were CTSB, GSR, CASP10, and MAPK13. Total number of common editing sites were found in different conditions and these were 667 for cell lines and 23 for fetuses. Validation of these variations in a larger samples size determines refined editing sites which could be used as potential diagnostic markers for intervention.
Due to the complex nature of noncovalent interactions and their long-range effects, analyzing protein conformations using network theory can be enlightening. Protein Structure Networks (PSNs) provide a convenient formalism to study protein structures in relation to essential properties such as key residues for structural stability, allosteric communication, and the effects of modifications of the protein. PSNs can be defined according to very different principles, and the available tools have limitations in input formats, supported models, and version control. Other outstanding problems are related to the definition of network cutoffs and the assessment of the stability of the network properties. The protein science community could benefit from a common framework to carry out these analyses and make them easier to reproduce, reuse, and evaluate. We here provide two open-source software packages, PyInteraph2 and PyInKnife2, to implement and analyze PSNs in a reproducible and documented manner. PyInteraph2 interfaces with multiple formats for protein ensembles and incorporates different network models with the possibility of integrating them into a macronetwork and performing various downstream analyses, including hubs, connected components, and several other centrality measures, and visualizes the networks or further analyzes them thanks to compatibility with Cytoscape.PyInKnife2 that supports the network models implemented in PyInteraph2. It employs a jackknife resampling approach to estimate the convergence of network properties and streamline the selection of distance cutoffs. We foresee that the modular structure of the code and the supported version control system will promote the transition to a community-driven effort, boost reproducibility, and establish common protocols in the PSN field. As developers, we will guarantee the introduction of new functionalities and maintenance, assistance, and training of new contributors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.