Better Training of GFlowNets with Local Credit and Incomplete Trajectories

Pan, Ling; Malkin, Nikolay; Zhang, Dinghuai; Bengio, Yoshua

doi:10.48550/arxiv.2302.01687

Cited by 1 publication

(16 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Partial inference is a promising paradigm to resolve this issue by incorporating local credits (Pan et al, 2023a). Specifically, the partial inference aims to evaluate individual transitions or sub-trajectories, i.e., local credits, and provide informative training signals for identifying the specific contributions of actions.…”

Section: Partial Inference For Gflownetsmentioning

confidence: 99%

“…To sample from the Boltzmann distribution, GFlowNet trains the policy to assign action selection probability based on energy of terminal state (Bengio et al, 2021a;b;Malkin et al, 2022a), e.g., a high probability to the action responsible for the low terminal energy. However, such training has fundamental limitations in credit assignment, as it is hard to identify the action responsible for terminal energy (Pan et al, 2023a). This limitation stems from solely relying on the terminal energy associated with multiple actions, lacking the information to identify the contribution of individual actions, akin to challenges in RL with sparse reward (Arjona-Medina et al, 2019;Ren et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

“…An attractive paradigm to tackle this issue is partial inference (Pan et al, 2023a) that trains flow functions with local credits, e.g., evaluation of the intermediate states or transitions. Such local credit identifies individual action contributions to the terminal energy before reaching the terminal state.…”

Section: Introductionmentioning

confidence: 99%

“…Such local credit identifies individual action contributions to the terminal energy before reaching the terminal state. To this end, Pan et al (2023a) proposed a forward-looking GFlowNet (FL-GFN), which assigns the local credit based on the energy of incomplete objects associated with intermediate states.…”

Section: Introductionmentioning

confidence: 99%

“…We extensively validate LED-GFN on various tasks: set generation (Pan et al, 2023a), bag generation (Shen et al, 2023), molecular discovery (Bengio et al, 2021b), RNA sequence generation (Jain et al, 2022), and the maximum independent set problem (Zhang et al, 2023). We observe that LED-GFN (1) outperforms FL-GFN when the assumption of intermediate energy does not hold, (2) excels in practical domains compared to GFlowNets and RL-based baselines, and (3) achieves similar performance to FL-GFN even when intermediate energy provides the "ideal" local credit.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Emergence of Functional Circuits in the Early Visual Pathway

Jang

Paik

2022

KAIST Research Series

View full text Add to dashboard Cite

This paper studies generative flow networks (GFlowNets) to sample objects from the Boltzmann energy distribution via a sequence of actions. In particular, we focus on improving GFlowNet with partial inference: training flow functions with the evaluation of the intermediate states or transitions. To this end, the recently developed forward-looking GFlowNet reparameterizes the flow functions based on evaluating the energy of intermediate states. However, such an evaluation of intermediate energies may (i) be too expensive or impossible to evaluate and (ii) even provide misleading training signals under large energy fluctuations along the sequence of actions. To resolve this issue, we propose learning energy decompositions for GFlowNets (LED-GFN). Our main idea is to (i) decompose the energy of an object into learnable potential functions defined on state transitions and (ii) reparameterize the flow functions using the potential functions. In particular, to produce informative local credits, we propose to regularize the potential to change smoothly over the sequence of actions. It is also noteworthy that training GFlowNet with our learned potential can preserve the optimal policy. We empirically verify the superiority of LED-GFN in five problems including the generation of unstructured and maximum independent sets, molecular graphs, and RNA sequences.

show abstract

Section: Partial Inference For Gflownetsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations