Mini-batch optimization enables training of ODE models on large-scale datasets

Stapor, Paul; Schmiester, Leonard; Wierling, Christoph; Merkt, Simon; Pathirana, Dilan; Lange, Bodo; Weindl, Daniel; Hasenauer, Jan

doi:10.1038/s41467-021-27374-6

Cited by 12 publications

(8 citation statements)

References 67 publications

(68 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Increasing the network size will quadratically increase the number of unknown parameters, which will significantly increase the computational requirements for obtaining robust solutions. Yet, recent work has shown that large estimation problems in ODE models may be broken into several smaller problems 75 , which may be applicable here, and is likely to yield large computational speed up by allowing parallelization of much smaller tasks. However, theory on how to merge potentially discrepant results between independently estimated overlapping subnetworks would need to be derived.…”

Section: Discussionmentioning

confidence: 99%

Network inference from perturbation time course data

et al. 2022

View full text Add to dashboard Cite

Networks underlie much of biology from subcellular to ecological scales. Yet, understanding what experimental data are needed and how to use them for unambiguously identifying the structure of even small networks remains a broad challenge. Here, we integrate a dynamic least squares framework into established modular response analysis (DL-MRA), that specifies sufficient experimental perturbation time course data to robustly infer arbitrary two and three node networks. DL-MRA considers important network properties that current methods often struggle to capture: (i) edge sign and directionality; (ii) cycles with feedback or feedforward loops including self-regulation; (iii) dynamic network behavior; (iv) edges external to the network; and (v) robust performance with experimental noise. We evaluate the performance of and the extent to which the approach applies to cell state transition networks, intracellular signaling networks, and gene regulatory networks. Although signaling networks are often an application of network reconstruction methods, the results suggest that only under quite restricted conditions can they be robustly inferred. For gene regulatory networks, the results suggest that incomplete knockdown is often more informative than full knockout perturbation, which may change experimental strategies for gene regulatory network reconstruction. Overall, the results give a rational basis to experimental data requirements for network reconstruction and can be applied to any such problem where perturbation time course experiments are possible.

show abstract

Section: Discussionmentioning

confidence: 99%

Network inference from perturbation time course data

et al. 2022

View full text Add to dashboard Cite

show abstract

“…To that end, four strategies are widely utilized during training. (i) Mini-batch [ 25 , 26 ]: mini-batch only utilizes a batch of data instead of full data during each update to reduce memory usage and improve the training efficiency. (ii) Stochastic gradient descent (SGD) [ 27 , 28 ]: The SGD strategy adds random factors in gradient calculation, which is generally fast and benefits the model’s generalization.…”

Section: Overview Of Deep Learning Methodsmentioning

confidence: 99%

The Synergy between Deep Learning and Organs-on-Chips for High-Throughput Drug Screening: A Review

et al. 2023

View full text Add to dashboard Cite

Organs-on-chips (OoCs) are miniature microfluidic systems that have arguably become a class of advanced in vitro models. Deep learning, as an emerging topic in machine learning, has the ability to extract a hidden statistical relationship from the input data. Recently, these two areas have become integrated to achieve synergy for accelerating drug screening. This review provides a brief description of the basic concepts of deep learning used in OoCs and exemplifies the successful use cases for different types of OoCs. These microfluidic chips are of potential to be assembled as highly potent human-on-chips with complex physiological or pathological functions. Finally, we discuss the future supply with perspectives and potential challenges in terms of combining OoCs and deep learning for image processing and automation designs.

show abstract

“…A common practice in data science is to use the elbow method [34], which was first employed by Thorndike [35]. The elbow method amounts to choosing K as the elbow of the curve of the minimum of the objective function in Eq (15) as K is varied. The interpretation of picking the elbow of the curve, in clustering, corresponds to choosing K such that adding futher clusters does not provide a significantly better fit to the data.…”

Section: Clustering Analysis Reveals Functional Subgroupsmentioning

confidence: 99%

“…Inspired by the method of stochastic gradient descent (SGD) [12][13][14] in machine learning, we propose a minibatch approach to tackle this issue: for each comparison between simulated and observed data, we use a stochastically sampled subset (minibatch) of the data. A similar minibatch method has been employed very recently by Stapor et al [15] to successfully calibrate ordinary differential equation (ODE) models with a significant improvement in computational performance, and by Seita et al [16] within the context of MCMC, likewise with a significant computational speed-up. We demonstrate that choosing a large enough minibatch ensures that the relevant signatures in the observed data can be accurately estimated, while avoiding unnecessary comparisons that slow down inference.…”

Section: Introductionmentioning

confidence: 99%

Efficient Bayesian inference for mechanistic modelling with high-throughput data

2022

View full text Add to dashboard Cite

Bayesian methods are routinely used to combine experimental data with detailed mathematical models to obtain insights into physical phenomena. However, the computational cost of Bayesian computation with detailed models has been a notorious problem. Moreover, while high-throughput data presents opportunities to calibrate sophisticated models, comparing large amounts of data with model simulations quickly becomes computationally prohibitive. Inspired by the method of Stochastic Gradient Descent, we propose a minibatch approach to approximate Bayesian computation. Through a case study of a high-throughput imaging scratch assay experiment, we show that reliable inference can be performed at a fraction of the computational cost of a traditional Bayesian inference scheme. By applying a detailed mathematical model of single cell motility, proliferation and death to a data set of 118 gene knockdowns, we characterise functional subgroups of gene knockdowns, each displaying its own typical combination of local cell density-dependent and -independent motility and proliferation patterns. By comparing these patterns to experimental measurements of cell counts and wound closure, we find that density-dependent interactions play a crucial role in the process of wound healing.

show abstract

Mini-batch optimization enables training of ODE models on large-scale datasets

Cited by 12 publications

References 67 publications

Network inference from perturbation time course data

Network inference from perturbation time course data

The Synergy between Deep Learning and Organs-on-Chips for High-Throughput Drug Screening: A Review

Efficient Bayesian inference for mechanistic modelling with high-throughput data

Contact Info

Product

Resources

About