Optica Open
Browse

Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?

Download (2.07 MB)
Version 2 2025-02-19, 05:03
Version 1 2025-02-18, 06:54
preprint
posted on 2025-02-19, 05:03 authored by Michael Doherty, Robin Matzner, Alejandra Beghelli, Polina Bayvel, Rasoul Sadeghi Yamchi
The application of reinforcement learning (RL) to dynamic resource allocation in optical networks has been the focus of intense research activity in recent years, with almost 100 peer-reviewed papers. We present a review of progress in the field, and identify significant gaps in benchmarking practices and reproducibility. To determine the strongest benchmark algorithms, we systematically evaluate several heuristics across diverse network topologies. We find that path count and sort criteria for path selection significantly affect the benchmark performance. We meticulously recreate the problems from five landmark papers and apply the improved benchmarks. Our comparisons demonstrate that simple heuristics consistently match or outperform the published RL solutions, often with an order of magnitude lower blocking probability. Furthermore, we present empirical lower bounds on network blocking using a novel defragmentation-based method, revealing that potential improvements over the benchmark heuristics are limited to 19-36% increased traffic load for the same blocking performance in our examples. We make our simulation framework and results publicly available to promote reproducible research and standardized evaluation https://doi.org/10.5281/zenodo.12594495.<p></p>

History

Funder Name

Engineering and Physical Sciences Research Council ( EP/S022139/1,EP/R035342/1)

Preprint ID

120915

Highlighter Commentary

This preprint reproduces five key reinforcement learning (RL) studies using improved heuristic benchmarks and finds that none of the RL methods consistently surpasses these enhanced heuristics. It concludes that while RL holds theoretical promise, practical gains in current settings are marginal, suggesting the need for better benchmarking and research focus on more complex, multi-objective problems. -- Ihtesham Khan, Nokia Bell Labs, USA

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC