2019
DOI: 10.1088/1751-8121/ab5877
|View full text |Cite
|
Sign up to set email alerts
|

Parameter estimation for biochemical reaction networks using Wasserstein distances

Abstract: We present a method for estimating parameters in stochastic models of biochemical reaction networks by fitting steady-state distributions using Wasserstein distances. We simulate a reaction network at different parameter settings and train a Gaussian process to learn the Wasserstein distance between observations and the simulator output for all parameters. We then use Bayesian optimization to find parameters minimizing this distance based on the trained Gaussian process. The effectiveness of our method is demo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
18
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
2

Relationship

3
5

Authors

Journals

citations
Cited by 28 publications
(19 citation statements)
references
References 36 publications
1
18
0
Order By: Relevance
“…by measuring nascent RNA numbers using live-cell imaging techniques (such as the MS2 system) at several time points for many cells [35, 36]. Our ANN based inference method rests upon the matching of distributions and hence similar to non-ANN based methods developed in Refs [37, 38], it avoids the pitfalls of moment-based inference [39,40]. We note that the ability to approximate solutions of delay master equations from simulated data while simultaneously optimizing for the parameters has not been demonstrated before; deep learning frameworks have previously achieved similar feats for deterministic models [13, 16] and more recently for stochastic models described by multi-dimensional Fokker-Planck equations [41, 42].…”
Section: Discussionmentioning
confidence: 99%
“…by measuring nascent RNA numbers using live-cell imaging techniques (such as the MS2 system) at several time points for many cells [35, 36]. Our ANN based inference method rests upon the matching of distributions and hence similar to non-ANN based methods developed in Refs [37, 38], it avoids the pitfalls of moment-based inference [39,40]. We note that the ability to approximate solutions of delay master equations from simulated data while simultaneously optimizing for the parameters has not been demonstrated before; deep learning frameworks have previously achieved similar feats for deterministic models [13, 16] and more recently for stochastic models described by multi-dimensional Fokker-Planck equations [41, 42].…”
Section: Discussionmentioning
confidence: 99%
“…The discrepancy between empirical distributions and predicted distributions by our jump diffusion process is measured by Wasserstein distance for its smoothness (Arjovsky et al, 2017). Wasserstein distance is also used in parameter estimation of biochemical reaction networks (Öcal et al, 2019). Wasserstein-1 distance, also called Earth-Mover distance, is defined by where Π( P 1 , P 2 ) denotes the set of every joint distribution γ whose marginals are P 1 , P 2 respectively.…”
Section: Methodsmentioning
confidence: 99%
“…(2) and (4) simplify to while the covariances and between the species are zero; here, the subscript c denotes the constitutive limit. This drastic simplification reflects the fact that, in the constitutive limit, the distributions of mature RNA and local RNAP are Poissonian: as the regulatory network is effectively given by then, the result follows directly from the exact solution provided in Jahnke and Huisinga ( 2007 ).…”
Section: Detailed Stochastic Model Of Transcription: Set-up and Analymentioning
confidence: 99%
“…Thus far, we have derived expressions for the first two moments of the distributions of total RNAP and mature RNA. Naturally, it would also be useful to derive closed-form expressions for the distributions themselves; such a derivation is, however, analytically intractable in general (Jahnke and Huisinga 2007 ) due to the presence of the catalytic reaction , which models initiation of the transcription process. Still, there are two special cases where analytical distributions are known: (i) when the elongation time is considered to be fixed, which corresponds to our model with at constant (Heng et al.…”
Section: Approximate Distributions Of Total Rnap and Mature Rnamentioning
confidence: 99%