PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning

Kaleel, Manaz; Torrisi, Mirko; Mooney, Catherine; Pollastri, Gianluca

doi:10.1007/s00726-019-02767-6

Cited by 24 publications

(22 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Finally, all residues with RSA ≥ 20% were labeled as exposed (E) or buried (B) otherwise. This threshold (or similar ones, in the range of 15-25% RSA) is routinely adopted for computing the protein surfaces and deriving classification datasets in many studies (Thompson and Goldstein, 1996;Mucchielli-Giorgi et al, 1999;Pollastri et al, 2002;Kaleel et al, 2019), since it roughly divides the set of residues in a protein in two equally-sized subsets. In HVAR3D, using a 20% RSA threshold, we obtain 55% and 45% of residues classified as buried and exposed, respectively, corresponding to a realistic characterization of the protein interior (accounting for completely and partially buried residues) and surface (Miller et al, 1987).…”

Section: Hvar3d-20: a Dataset Of Variations Covered By 3d Structurementioning

confidence: 99%

“…With the advent of machine and deep learning-based approaches (Baldi, 2018), many methods became available for predicting RSA and ASA. They differ mainly in the machine learning approach, the volume of the database of protein structures and the predicted output (ASA, RSA, or binary classification) (Rost and Sander, 1994;Pollastri et al, 2002;Drozdetskiy et al, 2015;Ma and Wang, 2015;Fan et al, 2016;Wu et al, 2017;Kaleel et al, 2019;Klausen et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

Savojardo

Manfredi

Martelli

et al. 2021

Front. Mol. Biosci.

View full text Add to dashboard Cite

Solvent accessibility (SASA) is a key feature of proteins for determining their folding and stability. SASA is computed from protein structures with different algorithms, and from protein sequences with machine-learning based approaches trained on solved structures. Here we ask the question as to which extent solvent exposure of residues can be associated to the pathogenicity of the variation. By this, SASA of the wild-type residue acquires a role in the context of functional annotation of protein single-residue variations (SRVs). By mapping variations on a curated database of human protein structures, we found that residues targeted by disease related SRVs are less accessible to solvent than residues involved in polymorphisms. The disease association is not evenly distributed among the different residue types: SRVs targeting glycine, tryptophan, tyrosine, and cysteine are more frequently disease associated than others. For all residues, the proportion of disease related SRVs largely increases when the wild-type residue is buried and decreases when it is exposed. The extent of the increase depends on the residue type. With the aid of an in house developed predictor, based on a deep learning procedure and performing at the state-of-the-art, we are able to confirm the above tendency by analyzing a large data set of residues subjected to variations and occurring in some 12,494 human protein sequences still lacking three-dimensional structure (derived from HUMSAVAR). Our data support the notion that surface accessible area is a distinguished property of residues that undergo variation and that pathogenicity is more frequently associated to the buried property than to the exposed one.

show abstract

Section: Hvar3d-20: a Dataset Of Variations Covered By 3d Structurementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

Savojardo

Manfredi

Martelli

et al. 2021

Front. Mol. Biosci.

View full text Add to dashboard Cite

show abstract

“…NetSurfP-2.0 (http://www.cbs.dtu.dk/services/NetSurfP/) (Klausen et al, 2019) was used to predict the surface accessibility of every Ser and Thr residue of human ACE2 3D structure. PaleAle 5.0 (http://distilldeep.ucd.ie/paleale/) (Kaleel et al, 2019) was used to predict the available area of Ser and Thr residues for ligand interaction. Yin-Yang sites were also predicted on 3D structure of ACE2 protein through docking experiments with the addition of the phosphate group, and UDP-GlcNAc individually using SwissDock server (http://www.swissdock.ch/) (Grosdidier et al, 2011).…”

Section: Prediction Of Yin-yang Sitesmentioning

confidence: 99%

O-β-GlcNAcylation, Chloroquine and 2-Hydroxybenzohydrazine May Hamper SARS-CoV-2 entry to Human via Inhibition of ACE2 Phosphorylation at Ser787 but Also Induce Disruption of Virus-ACE2 Binding

Ahmad¹,

Shabbiri²,

Islam³

2020

Preprint

View full text Add to dashboard Cite

The novel coronavirus COVID-19 disease is extremely contagious and has been spread worldwide. First COVID-19 case was identified in December, 2019 and within three months, more than one million affected cases and over 65,000 deaths have been reported. SARS-coronavirus 2 Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 22 April 2020 to the SARS CoV (Severe Acute Respiratory Syndrome corona virus) family. The SARS-CoV-2 enters the human body by binding its viral surface spike protein with the host angiotensinconverting enzyme 2 (ACE2) receptors and cause infection. To prevent the virus entry and its transmission in the human body, we focused on the two domains of ACE2: i) the N-terminal extracellular binding domain (18-740 residues) reported for coronavirus spike interaction, and ii) the C-terminal cytoplasmic region (762-805 residues) to prevent the virus transmission. Therefore, we proposed: i) inhibition of receptor binding domain (RBD) of SARS-CoV-2 and human ACE2 protein may prevent the virus entry to the host and ii) inhibition of phosphorylation at Ser-787 of ACE2 protein may prevent the transmission of the virus in the COVID-19 patients. In the past, the critical role of Ser 787 in human ACE2 protein has been experimentally verified in SARS-CoV transmission, that upon binding to the receptor, SARS-CoV induces CKII-mediated phosphorylation of ACE2 at Ser-787 that in-turn facilitate virus entry to host cells, followed by replication and activation of ACE2, initiates downstream signaling leading to lung fibrosis. Therefore, in this study, we have suggested post-translational modification (PTM) O-β-GlcNAcylation, and two compounds Chloroquine and 2-hydroxybenzohydrazine might share the common pathways to prevent the COVID-19 infection in human. The addition of O-β-GlcNAcylation at same or neighboring Ser/ Thr residues results in phosphorylation inhibition and a change in protein structural and functional confirmations. Thereby, using neural networking methods, we have identified Ser/ Thr residues in ACE2 that are potential sites for phosphorylation and / or O-β-GlcNAcylation. Molecular docking showed that UDP-GlcNAc has more binding affinity with Ser-787 than the phosphoryl group. Moreover, chloroquine and 2hydroxybenzohydrazine also showed great potential to bind at Ser-787 that may result in inhibition of Ser-787 phosphorylation and downstream signaling. Furthermore, O-β-GlcNAcylation, chloroquine and 2-hydroxybenzohydrazine showed their high affinity at ACE2-SARS-CoV-2receptor binding domain that may prevent the entry of SARS-CoV-2 into human body. In conclusion, inhibition of human ACE2 phosphorylation at Ser-787 and ACE2-SARS-CoV-2 binding domain could be promising targets against SARS-CoV-2 infection.

show abstract

“…Thus, we will also implement recurrent neural networks to predict the rSASA values for each residue. 67,68 This model can then be concatenated with the model developed here. Additionally, machine learning methods can be used to predict the particular f c for each amino acid sequence to estimate the difference, Δf c , between the prediction and the actual value.…”

Section: Discussionmentioning

confidence: 99%

“…For example, we have shown that the identification of core residues is one of the most important aspects for determining a predicted structure's accuracy. Thus, we will also implement recurrent neural networks to predict the rSASA values for each residue 67,68 . This model can then be concatenated with the model developed here.…”

Section: Discussionmentioning

confidence: 99%

Using physical features of protein core packing to distinguish real proteins from decoys

et al. 2020

View full text Add to dashboard Cite

The ability to consistently distinguish real protein structures from computationally generated model decoys is not yet a solved problem. One route to distinguish real protein structures from decoys is to delineate the important physical features that specify a real protein. For example, it has long been appreciated that the hydrophobic cores of proteins contribute significantly to their stability. We used two sources to obtain datasets of decoys to compare with real protein structures: submissions to the biennial Critical Assessment of protein Structure Prediction competition, in which researchers attempt to predict the structure of a protein only knowing its amino acid sequence, and also decoys generated by 3DRobot, which have user‐specified global root‐mean‐squared deviations from experimentally determined structures. Our analysis revealed that both sets of decoys possess cores that do not recapitulate the key features that define real protein cores. In particular, the model structures appear more densely packed (because of energetically unfavorable atomic overlaps), contain too few residues in the core, and have improper distributions of hydrophobic residues throughout the structure. Based on these observations, we developed a feed‐forward neural network, which incorporates key physical features of protein cores, to predict how well a computational model recapitulates the real protein structure without knowledge of the structure of the target sequence. By identifying the important features of protein structure, our method is able to rank decoy structures with similar accuracy to that obtained by state‐of‐the‐art methods that incorporate many additional features. The small number of physical features makes our model interpretable, emphasizing the importance of protein packing and hydrophobicity in protein structure prediction.

show abstract

PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning

Cited by 24 publications

References 42 publications

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

<strong>O-β-GlcNAcylation,</strong><strong> Chloroquine and </strong><strong>2-Hydroxybenzohydrazine May Hamper SARS-CoV-2 entry to Human via Inhibition of ACE2 Phosphorylation at Ser787 but Also Induce Disruption of Virus-ACE2 Binding</strong>

Using physical features of protein core packing to distinguish real proteins from decoys

Contact Info

Product

Resources

About

PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning

Cited by 24 publications

References 42 publications

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences

<strong>O-&beta;-GlcNAcylation,</strong><strong> Chloroquine and </strong><strong>2-Hydroxybenzohydrazine May Hamper SARS-CoV-2 entry to Human via Inhibition of ACE2 Phosphorylation at Ser787 but Also Induce Disruption of Virus-ACE2 Binding</strong>

Using physical features of protein core packing to distinguish real proteins from decoys

Contact Info

Product

Resources

About

<strong>O-β-GlcNAcylation,</strong><strong> Chloroquine and </strong><strong>2-Hydroxybenzohydrazine May Hamper SARS-CoV-2 entry to Human via Inhibition of ACE2 Phosphorylation at Ser787 but Also Induce Disruption of Virus-ACE2 Binding</strong>