Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.
Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.
Identifying protein-protein interactions is crucial for a systems-level understanding of the cell. Recently, algorithms based on inverse statistical physics, e.g. Direct Coupling Analysis (DCA), have allowed to use evolutionarily related sequences to address two conceptually related inference tasks: finding pairs of interacting proteins, and identifying pairs of residues which form contacts between interacting proteins. Here we address two underlying questions: How are the performances of both inference tasks related? How does performance depend on dataset size and the quality? To this end, we formalize both tasks using Ising models defined over stochastic block models, with individual blocks representing single proteins, and inter-block couplings protein-protein interactions; controlled synthetic sequence data are generated by Monte-Carlo simulations. We show that DCA is able to address both inference tasks accurately when sufficiently large training sets of known interaction partners are available, and that an iterative pairing algorithm (IPA) allows to make predictions even without a training set. Noise in the training data deteriorates performance. In both tasks we find a quadratic scaling relating dataset quality and size that is consistent with noise adding in square-root fashion and signal adding linearly when increasing the dataset. This implies that it is generally good to incorporate more data even if its quality is imperfect, thereby shedding light on the empirically observed performance of DCA applied to natural protein sequences.
homogenization scheme for piezoelectric composites with arbitrarily-oriented spheroidal inhomogeneities.Abstract In this work, the effective electro-elastic properties of piezoelectric composites are computed using the Maxwell homogenization method (MHM). The composites are made by several families of spheroidal inhomogeneities embedded in a homogeneous infinite medium (matrix). Each family of spheroidal inhomogeneities is made of the same material, and all the inhomogeneities have identical size and shape and are randomly oriented. The inhomogeneities and matrix materials exhibit piezoelectric transversely isotropic symmetry. It is shown that the shape of the "effective inclusion" substantially affects the effective piezoelectric properties. A new and simple form to calculate the aspect ratio of effective inclusion is presented. The effect on the overall piezoelectric properties due to the orientation of the inhomogeneities and different families of piezoelectric inhomogeneities is discussed. The MHM approach is applied in two examples, material with inhomogeneities having scatter orientation and composites with two different families of spheroidal inhomogeneities.
An analysis of the static behavior using the self-consistent effective medium method for a composite with randomly aligned spheroidal inclusions embedded in a matrix is studied. The constituents of which may have piezoelectric properties. Based on the self-consistent method proposed by F.J. Sabina and V. Levin an analysis of the static overall properties is developed for piezoelectric composites. The electroelastic Green's functions by Willis's approach is developed and the spheroidal shape inclusion is studied with the help of it. In that sense, the influence of various geometrical forms of the inclusions is analyzed. Comparisons with other self-consistent methods, finite element calculations, two scale asymptotic homogenization method and with experimental data show good results. [3249][3250][3251][3252][3253][3254][3255][3256][3257][3258][3259][3260][3261][3262][3263][3264] h i r .x, t/ D 1 j r j Z r . /.x, t, x 0 /dx 0 .(2.10)
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.