Théophane Weber scite author profile

Scene representation-the process of converting visual sensory data into concise descriptions-is a requirement for intelligent behavior. Recent work has shown that neural networks excel at this task when provided with large, labeled datasets. However, removing the reliance on human labeling remains an important open problem. To this end, we introduce the Generative Query Network (GQN), a framework within which machines learn to represent scenes using only their own sensors. The GQN takes as input images of a scene taken from different viewpoints, constructs an internal representation, and uses this representation to predict the appearance of that scene from previously unobserved viewpoints. The GQN demonstrates representation learning without human labels or domain knowledge, paving the way toward machines that autonomously learn to understand the world around them.

show abstract

Learning and Querying Fast Generative Models for Reinforcement Learning

Buesing¹,

Weber²,

Racanière³

et al. 2018

Preprint

View full text Add to dashboard Cite

Imagination-Augmented Agents for Deep Reinforcement Learning

Weber¹,

Racanière²,

Reichert³

et al. 2017

Preprint

View full text Add to dashboard Cite

We introduce Imagination-Augmented Agents (I2As), a novel architecture for deep reinforcement learning combining model-free and model-based aspects. In contrast to most existing model-based reinforcement learning and planning methods, which prescribe how a model should be used to arrive at a policy, I2As learn to interpret predictions from a learned environment model to construct implicit plans in arbitrary ways, by using the predictions as additional context in deep policy networks. I2As show improved data efficiency, performance, and robustness to model misspecification compared to several baselines.

show abstract

Quantifying Statistical Interdependence by Message Passing on Graphs—Part I: One-Dimensional Point Processes

et al. 2009

View full text Add to dashboard Cite

Stochastic event synchrony is a technique to quantify the similarity of pairs of signals. First, "events" are extracted from the two given time series. Next, one tries to align events from one time series with events from the other. The better the alignment, the more similar the two time series are considered to be. In Part I, onedimensional events are considered, this paper (Paper II) concerns multi-dimensional events. Although the basic idea is similar, the extension to multi-dimensional point processes involves a significantly harder combinatorial problem, and therefore, it is non-trivial.Also in the multi-dimensional, the problem of jointly computing the pairwise alignment and SES parameters is cast as a statistical inference problem. This problem is solved by coordinate descent, more specifically, by alternating the following two steps: (i) one estimates the SES parameters from a given pairwise alignment; (ii) with the resulting estimates, one refines the pairwise alignment. The SES parameters are computed by maximum a posteriori (MAP) estimation (Step 1), in analogy to the one-dimensional case. The pairwise alignment (Step 2) can no longer be obtained through dynamic programming, since the state space becomes too large. Instead it is determined by applying the max-product algorithm on a cyclic graphical model.In order to test the robustness and reliability of the SES method, it is first applied to surrogate data. Next it is applied to detect anomalies in EEG synchrony of Mild Cognitive Impairment (MCI) patients. Numerical results suggest that SES is significantly more sensitive to perturbations in EEG synchrony than a large variety of classical synchrony measures.

show abstract

Quantifying Statistical Interdependence by Message Passing on Graphs—Part II: Multidimensional Point Processes

et al. 2009

View full text Add to dashboard Cite

show abstract

Automated Variational Inference in Probabilistic Programming

Wingate¹,

Weber²

2013

Preprint

View full text Add to dashboard Cite

We present a new algorithm for approximate inference in probabilistic programs, based on a stochastic gradient for variational programs. This method is efficient without restrictions on the probabilistic program; it is particularly practical for distributions which are not analytically tractable, including highly structured distributions that arise in probabilistic programs. We show how to automatically derive mean-field probabilistic programs and optimize them, and demonstrate that our perspective improves inference efficiency over other algorithms.

show abstract

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Buesing¹,

Weber²,

Zwólš³

et al. 2018

Preprint

View full text Add to dashboard Cite

Learning policies on data synthesized by models can in principle quench the thirst of reinforcement learning algorithms for large amounts of real experience, which is often costly to acquire. However, simulating plausible experience de novo is a hard problem for many complex environments, often resulting in biases for modelbased policy evaluation and search. Instead of de novo synthesis of data, here we assume logged, real experience and model alternative outcomes of this experience under counterfactual actions, i.e. actions that were not actually taken. Based on this, we propose the Counterfactually-Guided Policy Search (CF-GPS) algorithm for learning policies in POMDPs from off-policy experience. It leverages structural causal models for counterfactual evaluation of arbitrary policies on individual off-policy episodes. CF-GPS can improve on vanilla model-based RL algorithms by making use of available logged data to de-bias model predictions. In contrast to off-policy algorithms based on Importance Sampling which re-weight data, CF-GPS leverages a model to explicitly consider alternative outcomes, allowing the algorithm to make better use of experience data. We find empirically that these advantages translate into improved policy evaluation and search results on a non-trivial grid-world task. Finally, we show that CF-GPS generalizes the previously proposed Guided Policy Search and that reparameterization-based algorithms such Stochastic Value Gradient can be interpreted as counterfactual methods.

show abstract

On Similarity Measures for Spike Trains

Dauwels

Vialatte

Weber

et al. 2009

View full text Add to dashboard Cite

Abstract. A variety of (dis)similarity measures for one-dimensional point processes (e.g., spike trains) are investigated, including the Victor-Purpura distance metric, the van Rossum distance metric, the Schreiber et al. similarity measure, the Hunter-Milton similarity measure, the event synchronization proposed by Quiroga, and the stochastic event synchrony measures (SES) recently proposed by Dauwels et al. By analyzing surrogate data, it is demonstrated that most measures are not able to distinguish timing precision and event reliability, i.e., they depend on both aspects of synchrony. There are two exceptions: with appropriate choice of parameters, event synchronization quantifies event reliability, independently of timing precision; the two SES parameters quantify both timing precision and event reliability separately. Before one can apply the (dis)similarity measures (with the exception of SES), one needs to determine potential lags between the point processes. On the other hand, SES deals with lags in a natural and direct way, and therefore, the SES similarity measures are robust to lags. As an illustration, neuronal spike data generated by the Morris-Lecar neuron model is considered.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.