Martin Chmelík scite author profile

Abstract.We present a general framework for applying machine-learning algorithms to the verification of Markov decision processes (MDPs). The primary goal of these techniques is to improve performance by avoiding an exhaustive exploration of the state space. Our framework focuses on probabilistic reachability, which is a core property for verification, and is illustrated through two distinct instantiations. The first assumes that full knowledge of the MDP is available, and performs a heuristic-driven partial exploration of the model, yielding precise lower and upper bounds on the required probability. The second tackles the case where we may only sample the MDP, and yields probabilistic guarantees, again in terms of both the lower and upper bounds, which provides efficient stopping criteria for the approximation. The latter is the first extension of statistical model-checking for unbounded properties in MDPs. In contrast with other related approaches, we do not restrict our attention to time-bounded (finite-horizon) or discounted properties, nor assume any particular properties of the MDP. We also show how our techniques extend to LTL objectives. We present experimental results showing the performance of our framework on several examples.

show abstract

Scheduling Real-Time Communication in IEEE 802.1Qbv Time Sensitive Networks

Craciunas

et al. 2016

View full text Add to dashboard Cite

Efficient Strategies for Calculating Blockwise Likelihoods Under the Coalescent

Lohse

Chmelík

Martin

et al. 2015

View full text Add to dashboard Cite

The inference of demographic history from genome data is hindered by a lack of efficient computational approaches. In particular, it has proved difficult to exploit the information contained in the distribution of genealogies across the genome. We have previously shown that the generating function (GF) of genealogies can be used to analytically compute likelihoods of demographic models from configurations of mutations in short sequence blocks (Lohse et al. 2011). Although the GF has a simple, recursive form, the size of such likelihood calculations explodes quickly with the number of individuals and applications of this framework have so far been mainly limited to small samples (pairs and triplets) for which the GF can be written by hand. Here we investigate several strategies for exploiting the inherent symmetries of the coalescent. In particular, we show that the GF of genealogies can be decomposed into a set of equivalence classes that allows likelihood calculations from nontrivial samples. Using this strategy, we automated blockwise likelihood calculations for a general set of demographic scenarios in Mathematica. These histories may involve population size changes, continuous migration, discrete divergence, and admixture between multiple populations. To give a concrete example, we calculate the likelihood for a model of isolation with migration (IM), assuming two diploid samples without phase and outgroup information. We demonstrate the new inference scheme with an analysis of two individual butterfly genomes from the sister species Heliconius melpomene rosina and H. cydno.

show abstract

What is decidable about partially observable Markov decision processes with ω-regular objectives

Chatterjee

Chmelík

Tracol

2016

Journal of Computer and System Sciences

View full text Add to dashboard Cite

We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specified as parity objectives. The class of ω-regular languages extends regular languages to infinite strings and provides a robust specification language to express all properties used in verification, and parity objectives are canonical forms to express ω-regular conditions. The qualitative analysis problem given a POMDP and a parity objective asks whether there is a strategy to ensure that the objective is satisfied with probability 1 (resp. positive probability). While the qualitative analysis problems are known to be undecidable even for very special cases of parity objectives, we establish decidability (with optimal complexity) of the qualitative analysis problems for POMDPs with all parity objectives under finitememory strategies. We establish optimal (exponential) memory bounds and EXPTIME-completeness of the qualitative analysis problems under finite-memory strategies for POMDPs with parity objectives.The class of ω-regular objectives. An objective specifies the desired set of behaviors (or paths) for the controller. In verification and control of stochastic systems an objective is typically an ω-regular set of paths. The class of ω-regular languages extends classical regular languages to infinite strings, and provides a robust specification language to express all commonly used specifications, such as safety, reachability, liveness, fairness, etc [37]. In a parity objective, every state of the MDP is mapped to a non-negative integer priority (or color) and the goal is to ensure that the minimum priority visited infinitely often is even. Parity objectives are a canonical way to define such ω-regular specifications (e.g., all specifications in verification expressed as a linear-time temporal logic (LTL) formula can be translated to a parity objective). Thus POMDPs with parity objectives provide the theoretical framework to study problems such as the verification and control of stochastic systems.Qualitative and quantitative analysis. The analysis of POMDPs with parity objectives can be classified into qualitative and quantitative analysis. Given a POMDP with a parity objective and a start state, the qualitative analysis asks whether the objective can be ensured with probability 1 (almost-sure winning) or positive probability (positive winning); whereas the quantitative analysis asks whether the objective can be satisfied with probability at least λ for a given threshold λ ∈ (0, 1). The limit-sure winning problem is EXPTIME-complete for safety objectives; and undecidable forreachability, Büchi, coBüchi, parity, and Muller objectives. 3. The almost-sure winning problem is EXPTIME-complete for safety, reachability, and Büchi objectives; and undecidable for coBüchi, parity, and Muller objectives. 4. The positive winning problem is PTIME-complete for reachability objectives, EXPTIME-complete for safety and coBüchi objectives; and undecidable for Büchi, parity, and Muller objectives.Explanation of the previous re...

show abstract

Qualitative analysis of POMDPs with temporal logic specifications for robotics applications

Chatterjee

Chmelík

Gupta

et al. 2015

View full text Add to dashboard Cite

We consider partially observable Markov decision processes (POMDPs), that are a standard framework for robotics applications to model uncertainties present in the real world, with temporal logic specifications. All temporal logic specifications in linear-time temporal logic (LTL) can be expressed as parity objectives. We study the qualitative analysis problem for POMDPs with parity objectives that asks whether there is a controller (policy) to ensure that the objective holds with probability 1 (almost-surely). While the qualitative analysis of POMDPs with parity objectives is undecidable, recent results show that when restricted to finite-memory policies the problem is EXPTIME-complete. While the problem is intractable in theory, we present a practical approach to solve the qualitative analysis problem. We designed several heuristics to deal with the exponential complexity, and have used our implementation on a number of well-known POMDP examples for robotics applications. Our results provide the first practical approach to solve the qualitative analysis of robot motion planning with LTL properties in the presence of uncertainty.

show abstract

Optimal cost almost-sure reachability in POMDPs

Chatterjee

Chmelík

Gupta

et al. 2016

Artificial Intelligence

View full text Add to dashboard Cite

We consider partially observable Markov decision processes (POMDPs) with a set of target states and every transition is associated with an integer cost. The optimization objective we study asks to minimize the expected total cost till the target set is reached, while ensuring that the target set is reached almost-surely (with probability 1). We show that for integer costs approximating the optimal cost is undecidable. For positive costs, our results are as follows: (i) we establish matching lower and upper bounds for the optimal cost and the bound is double exponential; (ii) we show that the problem of approximating the optimal cost is decidable and present approximation algorithms developing on the existing algorithms for POMDPs with finite-horizon objectives. While the worst-case running time of our algorithm is double exponential, we also present efficient stopping criteria for the algorithm and show experimentally that it performs well in many examples of interest.We present the definitions of POMDPs, strategies, objectives, and other basic notions required for our results. Throughout this work, we follow standard notations from [28,19].Notations. Given a finite set X, we denote by P(X) the set of subsets of X, i.e., P(X) is the power set of X.A probability distribution f on X is a function f : X → [0, 1] such that x∈X f (x) = 1, and we denote by D(X) the set of all probability distributions on X. For f ∈ D(X) we denote by Supp(f ) = {x ∈ X | f (x) > 0} the support of f . POMDPs.A Partially Observable Markov Decision Process (POMDP) is a tuple G = (S, A, δ, Z, O, λ 0 ) where: (i) S is a finite set of states; (ii) A is a finite alphabet of actions; (iii) δ : S × A → D(S) is a probabilistic transition function that given a state s and an action a ∈ A gives the probability distribution over the successor states, i.e., δ(s, a)(s ) denotes the transition probability from s to s given action a; (iv) Z is a finite set of observations; (v) O : S → Z is an observation function that maps every state to an observation; and (vi) λ 0 is a probability distribution for the initial state, and for all s, s ∈ Supp(λ 0 ) we require that O(s) = O(s ). If the initial distribution is Dirac, we often write λ 0 as s 0 where s 0 is the unique starting (or initial) state. Given s, s ∈ S and a ∈ A, we also write δ(s |s, a) for δ(s, a)(s ). A state s is absorbing if for all actions a we have δ(s, a)(s) = 1 (i.e., s is never left from s). For an observation z, we denote by O −1 (z) = {s ∈ S | O(s) = z} the set of states with observation z. For a set U ⊆ S of states and Z ⊆ Z of observations we denote O(U ) = {z ∈ Z | ∃s ∈ U. O(s) = z} and O −1 (Z) = z∈Z O −1 (z). A POMDP is a perfect-observation (or perfect-information) MDP if each state has a unique observation.Plays and cones. A play (or a path) in a POMDP is an infinite sequence (s 0 , a 0 , s 1 , a 1 , s 2 , a 2 , . . .) of states and actions such that for all i ≥ 0 we have δ(s i , a i )(s i+1 ) > 0 and s 0 ∈ Supp(λ 0 ). We write Ω for the set of

show abstract

Efficient strategies for calculating blockwise likelihoods under the coalescent

Lohse

Chmelík

Martin

et al. 2015

Preprint

View full text Add to dashboard Cite

show abstract

Temporal logic control for stochastic linear systems using abstraction refinement of probabilistic games

Svoreňová

Křetínský

Chmelík

et al. 2015

View full text Add to dashboard Cite

We consider the problem of computing the set of initial states of a dynamical system such that there exists a control strategy to ensure that the trajectories satisfy a temporal logic specification with probability 1 (almost-surely). We focus on discrete-time, stochastic linear dynamics and specifications given as formulas of the Generalized Reactivity(1) fragment of Linear Temporal Logic over linear predicates in the states of the system. We propose a solution based on iterative abstraction-refinement, and turn-based 2-player probabilistic games. While the theoretical guarantee of our algorithm after any finite number of iterations is only a partial solution, we show that if our algorithm terminates, then the result is the set of satisfying initial states. Moreover, for any (partial) solution our algorithm synthesizes witness control strategies to ensure almost-sure satisfaction of the temporal logic specification. We demonstrate our approach on an illustrative case study.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Martin Chmelík

Verification of Markov Decision Processes Using Learning Algorithms

Scheduling Real-Time Communication in IEEE 802.1Qbv Time Sensitive Networks

Efficient Strategies for Calculating Blockwise Likelihoods Under the Coalescent

What is decidable about partially observable Markov decision processes with ω-regular objectives

Qualitative analysis of POMDPs with temporal logic specifications for robotics applications

Optimal cost almost-sure reachability in POMDPs

Efficient strategies for calculating blockwise likelihoods under the coalescent

Temporal logic control for stochastic linear systems using abstraction refinement of probabilistic games

Contact Info

Product

Resources

About