Learning the travelling salesperson problem requires rethinking generalization

Joshi, Chaitanya K; Cappart, Quentin; Rousseau, Louis-Martin; Laurent, Thomas

Learning the travelling salesperson problem requires rethinking generalization

Published version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/337764

Repository DOI

https://doi.org/10.17863/CAM.85173

Files

Bibliographic metadata (144.97 KB)

Published version (3.18 MB)

Type

Article

Authors

Joshi, Chaitanya K

https://orcid.org/0000-0003-4722-1815

Cappart, Quentin

Rousseau, Louis-Martin

Laurent, Thomas

Abstract

jats:titleAbstract</jats:title>jats:pEnd-to-end training of neural network solvers for graph combinatorial optimization problems such as the Travelling Salesperson Problem (TSP) have seen a surge of interest recently, but remain intractable and inefficient beyond graphs with few hundreds of nodes. While state-of-the-art learning-driven approaches for TSP perform closely to classical solvers when trained on trivially small sizes, they are unable to generalize the learnt policy to larger instances at practical scales. This work presents an end-to-endjats:italicneural combinatorial optimization</jats:italic>pipeline that unifies several recent papers in order to identify the inductive biases, model architectures and learning algorithms that promote generalization to instances larger than those seen in training. Our controlled experiments provide the first principled investigation into suchjats:italiczero-shot</jats:italic>generalization, revealing that extrapolating beyond training data requires rethinking the neural combinatorial optimization pipeline, from network layers and learning paradigms to evaluation protocols. Additionally, we analyze recent advances in deep learning for routing problems through the lens of our pipeline and provide new directions to stimulate future research.</jats:p>

Keywords

46 Information and Computing Sciences, 4611 Machine Learning

Journal Title

Constraints

Journal ISSN

1383-7133
1572-9354

Volume Title

27

Publisher

Springer Science and Business Media LLC

Publisher DOI

https://doi.org/10.1007/s10601-022-09327-y

Rights

Attribution 4.0 International

Collections

Jisc Publications Router