Maria I. Gorinova scite author profile

Stan is a probabilistic programming language that has been increasingly used for real-world scalable projects. However, to make practical inference possible, the language sacrifices some of its usability by adopting a block syntax, which lacks compositionality and flexible user-defined functions. Moreover, the semantics of the language has been mainly given in terms of intuition about implementation, and has not been formalised.This paper provides a formal treatment of the Stan language, and introduces the probabilistic programming language SlicStan -a compositional, self-optimising version of Stan. Our main contributions are (1) the formalisation of a core subset of Stan through an operational density-based semantics; (2) the design and semantics of the Stan-like language SlicStan, which facilities better code reuse and abstraction through its compositional syntax, more flexible functions, and information-flow type system; and (3) a formal, semanticpreserving procedure for translating SlicStan to Stan.

show abstract

A Live, Multiple-Representation Probabilistic Programming Environment for Novices

Gorinova

Sarkar

Blackwell³

et al. 2016

View full text Add to dashboard Cite

On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

Rossi¹,

Kenlay²,

Gorinova³

et al. 2021

Preprint

View full text Add to dashboard Cite

While Graph Neural Networks (GNNs) have recently become the de facto standard for modeling relational data, they impose a strong assumption on the availability of the node or edge features of the graph. In many real-world applications, however, features are only partially available; for example, in social networks, age and gender are available only for a small subset of users. We present a general approach for handling missing features in graph machine learning applications that is based on minimization of the Dirichlet energy and leads to a diffusion-type differential equation on the graph. The discretization of this equation produces a simple, fast and scalable algorithm which we call Feature Propagation. We experimentally show that the proposed approach outperforms previous methods on seven common node-classification benchmarks and can withstand surprisingly high rates of missing features: on average we observe only around 4% relative accuracy drop when 99% of the features are missing. Moreover, it takes only 10 seconds to run on a graph with ∼2.5M nodes and ∼123M edges on a single GPU.

show abstract

Predicting Gaming Related Properties from Twitter Profiles

Kalaitzis

Gorinova

Lewenberg

et al. 2016

View full text Add to dashboard Cite

Transforming spreadsheets with data noodles

Gorinova

Sarkar

Blackwell

et al. 2016

View full text Add to dashboard Cite

Data wrangling is the term used by data scientists for the work of re-organising data into a new structure, before work starts on reporting or analysis. We present a prototype that applies programming by example methods to data wrangling in spreadsheets. The Data Noodles system guides the user through constructing a simple example that illustrates how they would like their spreadsheet to look. A transformation program is then synthesised and executed to produce the final reshaped spreadsheet.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Maria I. Gorinova

Probabilistic programming with densities in SlicStan: efficient, flexible, and deterministic

A Live, Multiple-Representation Probabilistic Programming Environment for Novices

On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

Predicting Gaming Related Properties from Twitter Profiles

Transforming spreadsheets with data noodles

Contact Info

Product

Resources

About