Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Parsonson, Christopher W. F.; Laterre, Alexandre; Barrett, Thomas D.

doi:10.48550/arxiv.2205.14345

Cited by 2 publications

(2 citation statements)

References 22 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Consequently, practitioners rely on either approximate algorithms, which give restricted performance guarantees and poor scalability (Williamson and Shmoys, 2011), or heuristics, which have limited solution efficacy (Halim and Ismail, 2019). Since the first application of neural networks to CO by Hopfield and Tank (1985), the last decade has seen a resurgence in ML-for-CO (Bello* et al, 2017;Dai et al, 2017;Barrett et al, 2019;Gasse et al, 2019;Barrett et al, 2022;Parsonson et al, 2022b). The advantages of ML-for-CO over approximation algorithms and heuristics include handling complex problems at scale, learning either without external input and achieving super-human performance or imitating strong but computationally expensive solvers, and (after training) leveraging the fast inference time of a DNN forward pass to rapidly generate solutions.…”

Section: Related Workmentioning

confidence: 99%

“…To select the algorithm hyperparameters, we conducted a Bayesian search across the search space summarised in Table 5, with simulations conducted in a light 32-worker RAMP environment with a maximum simulation run time of 2 × 10 5 seconds to speed up the search. We adopted similar search ranges to those used by Kurach et al (2019); Hoffman et al (2020); Parsonson et al (2022b). For each set of hyperparameters, we ran the algorithm for 100 learner steps (a.k.a.…”

Section: Reinforcement Learning Algorithmmentioning

confidence: 99%

See 1 more Smart Citation

Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks

Parsonson¹,

Shabka²,

Ottino³

et al. 2023

Preprint

View full text Add to dashboard Cite

From natural language processing to genome sequencing, large-scale machine learning models are bringing advances to a broad range of fields. Many of these models are too large to be trained on a single machine, and instead must be distributed across multiple devices. This has motivated the research of new compute and network systems capable of handling such tasks. In particular, recent work has focused on developing management schemes which decide how to allocate distributed resources such that some overall objective, such as minimising the job completion time (JCT), is optimised. However, such studies omit explicit consideration of how much a job should be distributed, usually assuming that maximum distribution is desirable. In this work, we show that maximum parallelisation is sub-optimal in relation to user-critical metrics such as throughput and blocking rate. To address this, we propose PAC-ML (partitioning for asynchronous computing with machine learning). PAC-ML leverages a graph neural network and reinforcement learning to learn how much to partition computation graphs such that the number of jobs which meet arbitrary user-defined JCT requirements is maximised. In experiments with five real deep learning computation graphs on a recently proposed optical architecture across four user-defined JCT requirement distributions, we demonstrate PAC-ML achieving up to 56.2% lower blocking rates in dynamic job arrival settings than the canonical maximum parallelisation strategy used by most prior works.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Reinforcement Learning Algorithmmentioning

confidence: 99%

Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks

Parsonson¹,

Shabka²,

Ottino³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks

Shabka¹,

Zervas²

2023

J. Opt. Commun. Netw.

View full text Add to dashboard Cite

Composable data center architectures promise a means of pooling resources remotely within data centers, allowing for both more flexibility and resource efficiency underlying the increasingly important infrastructure-as-a-service business. This can be accomplished by means of using an optically circuit switched backbone in the data center network (DCN), providing the required bandwidth and latency guarantees to ensure reliable performance when applications are run across non-local resource pools. However, resource allocation in this scenario requires both server-level and network-level resources to be co-allocated to requests. The online nature and underlying combinatorial complexity of this problem, alongside the typical scale of DCN topologies, make exact solutions impossible and heuristic-based solutions sub-optimal or non-intuitive to design. We demonstrate that deep reinforcement learning, where the policy is modeled by a graph neural network, can be used to learn effective network-aware and topologically scalable allocation policies end-to-end. Compared to state-of-the-art heuristics for network-aware resource allocation, the method achieves up to a 20% higher acceptance ratio, can achieve the same acceptance ratio as the best performing heuristic with 3 × less networking resources available, and can maintain all-around performance when directly applied (with no further training) to DCN topologies with 10 2 × more servers than the topologies seen during training.

show abstract

Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Cited by 2 publications

References 22 publications

Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks

Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks

Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks

Contact Info

Product

Resources

About