Arno G. Stefani scite author profile

Arno G. Stefani

5Publications

13Citation Statements Received

64Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Erlangen-Nuremberg

Publications

Order By: Most citations

Application of information theory to feature selection in protein docking

et al. 2011

View full text Add to dashboard Cite

In the era of structural genomics, the prediction of protein interactions using docking algorithms is an important goal. The success of this method critically relies on the identification of good docking solutions among a vast excess of false solutions. We have adapted the concept of mutual information (MI) from information theory to achieve a fast and quantitative screening of different structural features with respect to their ability to discriminate between physiological and nonphysiological protein interfaces. The strategy includes the discretization of each structural feature into distinct value ranges to optimize its mutual information. We have selected 11 structural features and two datasets to demonstrate that the MI is dimensionless and can be directly compared for diverse structural features and between datasets of different sizes. Conversion of the MI values into a simple scoring function revealed that those features with a higher MI are actually more powerful for the identification of good docking solutions. Thus, an MI-based approach allows the rapid screening of structural features with respect to their information content and should therefore be helpful for the design of improved scoring functions in future. In addition, the concept presented here may also be adapted to related areas that require feature selection for biomolecules or organic ligands.

show abstract

A tight lower bound on the mutual information of a binary and an arbitrary finite random variable as a function of the variational distance

Stefani

Huber

Jardin³

et al. 2014

View full text Add to dashboard Cite

THIS PAPER IS ELIGIBLE FOR THE STUDENT PAPER AWARD".In this paper a numerical method is presented, which finds a lower bound for the mutual information between a binary and an arbitrary finite random variable with joint distributions that have a variational distance not greater than a known value to a known joint distribution. This lower bound can be applied to mutual information estimation with confidence intervals.

show abstract

An information-theoretic classification of amino acids for the assessment of interfaces in protein–protein docking

et al. 2013

View full text Add to dashboard Cite

Docking represents a versatile and powerful method to predict the geometry of protein-protein complexes. However, despite significant methodical advances, the identification of good docking solutions among a large number of false solutions still remains a difficult task. We have previously demonstrated that the formalism of mutual information (MI) from information theory can be adapted to protein docking, and we have now extended this approach to enhance its robustness and applicability. A large dataset consisting of 22,934 docking decoys derived from 203 different protein-protein complexes was used for an MI-based optimization of reduced amino acid alphabets representing the protein-protein interfaces. This optimization relied on a clustering analysis that allows one to estimate the mutual information of whole amino acid alphabets by considering all structural features simultaneously, rather than by treating them individually. This clustering approach is fast and can be applied in a similar fashion to the generation of reduced alphabets for other biological problems like fold recognition, sequence data mining, or secondary structure prediction. The reduced alphabets derived from the present work were converted into a scoring function for the evaluation of docking solutions, which is available for public use via the web service score-MI: http://score-MI.biochem.uni-erlangen.de.

show abstract

Confidence intervals for the mutual information

Stefani¹,

Huber²,

Jardin

et al. 2014

IJMISSP

View full text Add to dashboard Cite

THIS PAPER IS ELIGIBLE FOR THE STUDENT PAPER AWARD"By combining a bound on the absolute value of the difference of mutual information between two joint probability distributions with a fixed variational distance, and a bound on the probability of a maximal deviation in variational distance between a true joint probability distribution and an empirical joint probability distribution, confidence intervals for the mutual information of two random variables with finite alphabets are established. Different from previous results, these intervals do not need any assumptions on the distribution and the sample size.

show abstract

Application of Methods from Information Theory in Protein-Interaction Analysis

Stefani

Sandmann

Burkovski

et al. 2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Arno G. Stefani

Application of information theory to feature selection in protein docking

A tight lower bound on the mutual information of a binary and an arbitrary finite random variable as a function of the variational distance

An information-theoretic classification of amino acids for the assessment of interfaces in protein–protein docking

Confidence intervals for the mutual information

Application of Methods from Information Theory in Protein-Interaction Analysis

Contact Info

Product

Resources

About