2016
DOI: 10.1080/01621459.2015.1099535
|View full text |Cite
|
Sign up to set email alerts
|

Collective Estimation of Multiple Bivariate Density Functions With Application to Angular-Sampling-Based Protein Loop Modeling

Abstract: This paper develops a method for simultaneous estimation of This paper develops a method for simultaneous estimation of density functions for a collection of populations of protein backbone angle pairs using a data-driven, shared basis that is constructed by bivariate spline functions defined on a triangulation of the bivariate domain. The circular nature of angular data is taken into account by imposing appropriate smoothness constraints across boundaries of the triangles. Maximum penalized likelihood is used… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
6
1

Relationship

3
4

Authors

Journals

citations
Cited by 10 publications
(16 citation statements)
references
References 74 publications
0
16
0
Order By: Relevance
“…First, more robust methods for the fourth-digit prediction are needed. The increasing number of enzymes that have experimentally validated functions, as well as the advance in method development for learning from imbalanced-data and small samples ( Maadooliat et al , 2016 ), provide potential solutions to the problem. Second, instead of predicting the EC numbers for enzymes, it is practically useful to predict enzymatic reactions of the enzymes.…”
Section: Discussionmentioning
confidence: 99%
“…First, more robust methods for the fourth-digit prediction are needed. The increasing number of enzymes that have experimentally validated functions, as well as the advance in method development for learning from imbalanced-data and small samples ( Maadooliat et al , 2016 ), provide potential solutions to the problem. Second, instead of predicting the EC numbers for enzymes, it is practically useful to predict enzymatic reactions of the enzymes.…”
Section: Discussionmentioning
confidence: 99%
“…() used Dirichlet process mixtures for density estimation of the distribution of dihedral angles: a problem also considered by Maadooliat et al . () by using non‐parametric density estimation, focusing on modelling loop regions of a protein; see also Najibi et al . ().…”
Section: Mathematical Formulation: Unlabelled Shape Analysismentioning
confidence: 99%
“…For example, Boomsma et al (2008) used hidden Markov models as generative models for protein structure. Lennox et al (2009) used Dirichlet process mixtures for density estimation of the distribution of dihedral angles: a problem also considered by Maadooliat et al (2016) by using non-parametric density estimation, focusing on modelling loop regions of a protein; see also Najibi et al (2017). Motivated by direct modelling of the evolution of a protein, Golden et al (2017) described the shape of a protein as a sequence of dihedral angles on the torus; their model captures dependences between sequence and structure evolution through a diffusion process on the torus.…”
Section: Mathematical Formulation: Unlabelled Shape Analysismentioning
confidence: 99%
“…Ting et al [34] and Joo et al [35] also used this technique with further modification to produce near-native loop structures. In another approach, Maadooliat et al [36] proposed a penalized spline collective density estimator (PSCDE) to represent the log-densities based on some shared basis functions. This method showed some significant improvements for loop modeling of the hard cases in a benchmark dataset where existing methods do not work well [36] .…”
Section: Introductionmentioning
confidence: 99%
“…The PSCDE method is constructed based on Bernstein-Bézier spline basis functions defined over triangles to estimate the log-densities in a complex domain [36] . In simple words, in PSCDE,we artificially extended the constraints of the adjacent triangles to the triangles in boundaries in order to estimate the densities in a two-dimensional circular domain.…”
Section: Introductionmentioning
confidence: 99%