Repository logo
 

Training and evaluation data for routing and wavelength assignment using multi objective reinforcement learning


No Thumbnail Available

Type

Dataset

Change log

Authors

Nallaperuma, Samadhi 
Gan, Zelin 
Nevin, Josh 
Shevchenko, Mykyta 
Savory, Seb 

Description

This dataset contains training and evaluation data generated from multi objective RL models for bi objective and 3 objective cases and the trained models. The training_monitor.csv provides the details for the training data in terms of the rewards and accepted and processed services where each row corresponds to an episode. The evaluation_monitor.csv contains the respective evaluation data including the rewards, processed and accepted numbr of services for each episode represented by a row. The trained models are saved as best_model.zip which can be used to reproduce evaluation results.

Version

Software / Usage instructions

To view training or evaluation data .csv format needs any csv file viewer (Excel or text editor). To reproduce these results run the trained RL models, following the instructions at Stable baselines 3 library evaluate policy https://stable-baselines3.readthedocs.io/en/master/_modules/stable_baselines3/common/evaluation.html

Keywords

Evaluation data

Publisher

Sponsorship
Engineering and Physical Sciences Research Council (EP/R035342/1)
EPSRC TRANSNET project (EP/R035342/1)
Relationships
Supplements: