Pranav Ashok scite author profile

Markov decision processes (MDPs) are standard models for probabilistic systems with non-deterministic behaviours. Long-run average rewards provide a mathematically elegant formalism for expressing long term performance. Value iteration (VI) is one of the simplest and most efficient algorithmic approaches to MDPs with other properties, such as reachability objectives. Unfortunately, a naive extension of VI does not work for MDPs with long-run average rewards, as there is no known stopping criterion. In this work our contributions are threefold.(1) We refute a conjecture related to stopping criteria for MDPs with longrun average rewards. (2) We present two practical algorithms for MDPs with long-run average rewards based on VI. First, we show that a combination of applying VI locally for each maximal end-component (MEC) and VI for reachability objectives can provide approximation guarantees. Second, extending the above approach with a simulation-guided on-demand variant of VI, we present an anytime algorithm that is able to deal with very large models. (3) Finally, we present experimental results showing that our methods significantly outperform the standard approaches on several benchmarks.

show abstract

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games

Ashok

Křetínský

Weininger

2019

View full text Add to dashboard Cite

Statistical model checking (SMC) is a technique for analysis of probabilistic systems that may be (partially) unknown. We present an SMC algorithm for (unbounded) reachability yielding probably approximately correct (PAC) guarantees on the results. We consider both the setting (i) with no knowledge of the transition function (with the only quantity required a bound on the minimum transition probability) and (ii) with knowledge of the topology of the underlying graph. On the one hand, it is the first algorithm for stochastic games. On the other hand, it is the first practical algorithm even for Markov decision processes. Compared to previous approaches where PAC guarantees require running times longer than the age of universe even for systems with a handful of states, our algorithm often yields reasonably precise results within minutes, not requiring the knowledge of mixing time.

show abstract

DeepAbstract: Neural Network Abstraction for Accelerating Verification

Ashok

Hashemi

Křetínský

et al. 2020

View full text Add to dashboard Cite

Continuous-Time Markov Decisions Based on Partial Exploration

Ashok

Butkova

Hermanns

et al. 2018

View full text Add to dashboard Cite

We provide a framework for speeding up algorithms for timebounded reachability analysis of continuous-time Markov decision processes. The principle is to find a small, but almost equivalent subsystem of the original system and only analyse the subsystem. Candidates for the subsystem are identified through simulations and iteratively enlarged until runs are represented in the subsystem with high enough probability. The framework is thus dual to that of abstraction refinement. We instantiate the framework in several ways with several traditional algorithms and experimentally confirm orders-of-magnitude speed ups in many cases.

show abstract

Approximating Values of Generalized-Reachability Stochastic Games

Ashok

Chatterjee

Křetínský

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pranav Ashok

Value Iteration for Long-Run Average Reward in Markov Decision Processes

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games

DeepAbstract: Neural Network Abstraction for Accelerating Verification

Continuous-Time Markov Decisions Based on Partial Exploration

Approximating Values of Generalized-Reachability Stochastic Games

Contact Info

Product

Resources

About