“…linear programming in Mitra, Reiman, and Wang (1998)), the RL algorithm used in this paper is a stochastic iterative algorithm, it does not require a priori knowledge of the state transition models (i.e., the state transition probabilities) associated with the underlying Markov chain, and thus can be used to solve real network problems with very large state spaces that cannot be handled by model-based algorithms, and can automatically adapt to real traffic conditions. This work builds on earlier work of the authors (Brown, Tong, & Singh, 1999) in that it provides a more general framework for studying the CAC and routing problem, under QoS constraints.…”