This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
While
the COVID-19 pandemic is causing important loss of life, knowledge
of the effects of the causative SARS-CoV-2 virus on human cells is
currently limited. Investigating protein–protein interactions
(PPIs) between viral and host proteins can provide a better understanding
of the mechanisms exploited by the virus and enable the identification
of potential drug targets. We therefore performed an in-depth computational
analysis of the interactome of SARS-CoV-2 and human proteins in infected
HEK 293 cells published by Gordon et al. (
Nature
2020
,
583
, 459–468) to reveal processes
that are potentially affected by the virus and putative protein binding
sites. Specifically, we performed a set of network-based functional
and sequence motif enrichment analyses on SARS-CoV-2-interacting human
proteins and on PPI networks generated by supplementing viral-host
PPIs with known interactions. Using a novel implementation of our
GoNet algorithm, we identified 329 Gene Ontology terms for which the
SARS-CoV-2-interacting human proteins are significantly clustered
in PPI networks. Furthermore, we present a novel protein sequence
motif discovery approach, LESMoN-Pro, that identified 9 amino acid
motifs for which the associated proteins are clustered in PPI networks.
Together, these results provide insights into the processes and sequence
motifs that are putatively implicated in SARS-CoV-2 infection and
could lead to potential therapeutic targets.
Background
Quantitative proteomics studies are often used to detect proteins that are differentially expressed across different experimental conditions. Functional enrichment analyses are then typically used to detect annotations, such as biological processes that are significantly enriched among such differentially expressed proteins to provide insights into the molecular impacts of the studied conditions. While common, this analytical pipeline often heavily relies on arbitrary thresholds of significance. However, a functional annotation may be dysregulated in a given experimental condition, while none, or very few of its proteins may be individually considered to be significantly differentially expressed. Such an annotation would therefore be missed by standard approaches.
Results
Herein, we propose a novel graph theory-based method, PIGNON, for the detection of differentially expressed functional annotations in different conditions. PIGNON does not assess the statistical significance of the differential expression of individual proteins, but rather maps protein differential expression levels onto a protein–protein interaction network and measures the clustering of proteins from a given functional annotation within the network. This process allows the detection of functional annotations for which the proteins are differentially expressed and grouped in the network. A Monte-Carlo sampling approach is used to assess the clustering significance of proteins in an expression-weighted network. When applied to a quantitative proteomics analysis of different molecular subtypes of breast cancer, PIGNON detects Gene Ontology terms that are both significantly clustered in a protein–protein interaction network and differentially expressed across different breast cancer subtypes. PIGNON identified functional annotations that are dysregulated and clustered within the network between the HER2+, triple negative and hormone receptor positive subtypes. We show that PIGNON’s results are complementary to those of state-of-the-art functional enrichment analyses and that it highlights functional annotations missed by standard approaches. Furthermore, PIGNON detects functional annotations that have been previously associated with specific breast cancer subtypes.
Conclusion
PIGNON provides an alternative to functional enrichment analyses and a more comprehensive characterization of quantitative datasets. Hence, it contributes to yielding a better understanding of dysregulated functions and processes in biological samples under different experimental conditions.
Background: Quantitative proteomics studies are often used to detect proteins that are differentially expressed across different experimental conditions. Functional enrichment analyses are then typically used to detect annotations, such as biological processes that are significantly enriched among such differentially expressed proteins to provide insights into the molecular impacts of the studied conditions. While common, this analytical pipeline heavily relies on arbitrary thresholds of significance. Indeed, a functional annotation may be dysregulated in a given experimental condition, while none or very few of its proteins may be individually considered to be significantly differentially expressed. Such an annotation would therefore be missed by standard approaches. Results: Herein, we propose a novel graph theory-based method, PIGNON, for the detection of differentially expressed functional annotations in different conditions. PIGNON does not assess the statistical significance of individual genes, but rather maps protein differential expression levels onto a protein-protein interaction network and measures the clustering of proteins from a given functional annotation within the network. This process allows the detection of functional annotations for which the proteins are differentially expressed and grouped in the network. A Monte-Carlo sampling approach is used to assess the clustering of proteins in an expression-weighted network. When applied to a quantitative proteomics analysis of different molecular subtypes of breast cancer, PIGNON detects Gene Ontology terms that are both significantly clustered in a protein-protein interaction network and differentially expressed across two breast cancer subtypes. PIGNON identified 168 breast cancer pathways dysregulated and clustered within the network between the HER2+ and triple negative subtypes, 203 breast cancer pathways shared by HER2+ and hormone receptor positive subtypes, 19 breast cancer pathways shared by hormone receptor positive and triple negative breast that are not detected by standard approaches. PIGNON identifies functional annotations that have been previously associated with specific breast cancer subtypes as well as novel annotations that may be implicated in the diseases. Conclusion: PIGNON provides an alternative to functional enrichment analyses and a more comprehensive characterization of quantitative datasets. Hence, it contributes to yielding a better understanding of dysregulated functions and processes in biological samples under different conditions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.