Neural Estimation and Optimization of Directed Information over Continuous Spaces

Tsur, Dor; Aharoni, Ziv; Goldfeld, Ziv; Permuter, Haim H.

doi:10.48550/arxiv.2203.14743

Cited by 1 publication

(9 citation statements)

References 61 publications

(104 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The DINE [22] is an RNN-based estimator of I(X → Y) from a sample D n := (X n , Y n ) ∼ P X n Y n . Its derivation begins with a representation of DI rate as the asymptotic difference of the following KL divergence terms:…”

Section: Directed Information Neural Estimationmentioning

confidence: 99%

“…, or in their parametrized form g θ , where θ ∈ Θ. With this notation, the DINE objective is given by [22]…”

Section: Directed Information Neural Estimationmentioning

confidence: 99%

“…The DINE architecture is portrayed in Figure 1. For formal consistency guarantees for DINE, as well as implementation details, the reader is referred to [22].…”

Section: Directed Information Neural Estimationmentioning

confidence: 99%

“…For memoryless channels, joint estimationoptimization methods over continuous input spaces were proposed in [20,21]. The case of channels with memory was recently treated in [22] using the DI neural estimator (DINE) developed therein. The DINE parametrizes the Donsker-Varadhan representation of DI by recurrent neural networks (RNNs), approximates expectations by sample means, and optimizes the resulting objective over the parameter space.…”

Section: Introductionmentioning

confidence: 99%

“…The DINE parametrizes the Donsker-Varadhan representation of DI by recurrent neural networks (RNNs), approximates expectations by sample means, and optimizes the resulting objective over the parameter space. To compute the feedback capacity, [22] further proposed an RNN-based generative model for continuous input distributions and jointly optimized it with DINE by propagating gradients through both models. These methods hinge on the end-to-end differentiability of the joint model, which fails to hold for discrete input alphabets.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Optimizing Estimated Directed Information over Discrete Alphabets

Tsur

Aharoni

Goldfeld

et al. 2022

2022 IEEE International Symposium on Information Theory (ISIT)

View full text Add to dashboard Cite

Directed information (DI) is a fundamental measure for the study and analysis of sequential stochastic models. In particular, when optimized over input distributions it characterizes the capacity of general communication channels. However, analytic computation of DI is typically intractable and existing optimization techniques over discrete input alphabets require knowledge of the channel model, which renders them inapplicable when only samples are available. To overcome these limitations, we propose a novel estimation-optimization framework for DI over discrete input spaces. We formulate DI optimization as a Markov decision process and leverage reinforcement learning techniques to optimize a deep generative model of the input process probability mass function (PMF). Combining this optimizer with the recently developed DI neural estimator, we obtain an end-to-end estimation-optimization algorithm which is applied to estimating the (feedforward and feedback) capacity of various discrete channels with memory. Furthermore, we demonstrate how to use the optimized PMF model to (i) obtain theoretical bounds on the feedback capacity of unifilar finite-state channels; and (ii) perform probabilistic shaping of constellations in the peak power-constrained additive white Gaussian noise channel.

show abstract

Section: Directed Information Neural Estimationmentioning

confidence: 99%

“…, or in their parametrized form g θ , where θ ∈ Θ. With this notation, the DINE objective is given by [22]…”

Section: Directed Information Neural Estimationmentioning

confidence: 99%

“…The DINE architecture is portrayed in Figure 1. For formal consistency guarantees for DINE, as well as implementation details, the reader is referred to [22].…”

Section: Directed Information Neural Estimationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations