Sadegh Soudjani scite author profile

Abstract. In this work we introduce new approximate similarity relations that are shown to be key for policy (or control) synthesis over general Markov decision processes. The models of interest are discrete-time Markov decision processes, endowed with uncountably-infinite state spaces and metric output (or observation) spaces. The new relations, underpinned by the use of metrics, allow in particular for a useful trade-off between deviations over probability distributions on states, and distances between model outputs. We show that the new probabilistic similarity relations, inspired by a notion of simulation developed for finite-state models, can be effectively employed over general Markov decision processes for verification purposes, and specifically for control refinement from abstract models.

show abstract

FAUST $^{\mathsf 2}$ : Formal Abstractions of Uncountable-STate STochastic Processes

Soudjani¹,

Gevaerts²,

Abate³

2015

View full text Add to dashboard Cite

Robust Dynamic Programming for Temporal Logic Control of Stochastic Systems

Haesaert

Soudjani

2021

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

Adaptive and Sequential Gridding Procedures for the Abstraction and Verification of Stochastic Processes

Soudjani¹,

Abate²

2013

SIAM J. Appl. Dyn. Syst.

110

View full text Add to dashboard Cite

This work is concerned with the generation of finite abstractions of general state-space processes to be employed in the formal verification of probabilistic properties by means of automatic techniques such as probabilistic model checkers. The work employs an abstraction procedure based on the partitioning of the state-space, which generates a Markov chain as an approximation of the original process. A novel adaptive and sequential gridding algorithm is presented and is expected to conform to the underlying dynamics of the model and thus to mitigate the curse of dimensionality unavoidably related to the partitioning procedure. The results are also extended to the general modeling framework known as stochastic hybrid systems. While the technique is applicable to a wide arena of probabilistic properties, with focus on the study of a particular specification (probabilistic safety, or invariance, over a finite horizon), the proposed adaptive algorithm is first benchmarked against a uniform gridding approach taken from the literature and finally tested on an applicative case study in Biology.

show abstract

From Dissipativity Theory to Compositional Construction of Finite Markov Decision Processes

Lavaei

Soudjani

Zamani

2018

View full text Add to dashboard Cite

This paper is concerned with a compositional approach for constructing finite Markov decision processes of interconnected discrete-time stochastic control systems. The proposed approach leverages the interconnection topology and a notion of so-called stochastic storage functions describing joint dissipativitytype properties of subsystems and their abstractions. In the first part of the paper, we derive dissipativitytype compositional conditions for quantifying the error between the interconnection of stochastic control subsystems and that of their abstractions. In the second part of the paper, we propose an approach to construct finite Markov decision processes together with their corresponding stochastic storage functions for classes of discrete-time control systems satisfying some incremental passivablity property. Under this property, one can construct finite Markov decision processes by a suitable discretization of the input and state sets. Moreover, we show that for linear stochastic control systems, the aforementioned property can be readily checked by some matrix inequality. We apply our proposed results to the temperature regulation in a circular building by constructing compositionally a finite Markov decision process of a network containing 200 rooms in which the compositionality condition does not require any constraint on the number or gains of the subsystems. We employ the constructed finite Markov decision process as a substitute to synthesize policies regulating the temperature in each room for a bounded time horizon.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sadegh Soudjani

Verification of General Markov Decision Processes by Approximate Similarity Relations and Policy Refinement

FAUST $^{\mathsf 2}$ : Formal Abstractions of Uncountable-STate STochastic Processes

Robust Dynamic Programming for Temporal Logic Control of Stochastic Systems

Adaptive and Sequential Gridding Procedures for the Abstraction and Verification of Stochastic Processes

From Dissipativity Theory to Compositional Construction of Finite Markov Decision Processes

Contact Info

Product

Resources

About