Fair model-based reinforcement learning comparisons with explicit and consistent update frequency

Thomas, Albert; Benechehab, Abdelhakim; Paolo, Giuseppe; Kégl, Balázs
ICLR 2024, 12th International Conference on Learning Representations, 3rd Blogpost Track, 7-11 May 2024, Vienna, Austria

Implicit update frequencies can introduce ambiguity in the interpretation of model-based reinforcement learning benchmarks, obscuring the real objective of the evaluation. While the update frequency can sometimes be optimized to improve performance, real-world applications often impose constraints, allowing updates only between deployments on the actual system. This blog post emphasizes the need for evaluations using consistent update frequencies across different algorithms to provide researchers and practitioners with clearer comparisons under realistic constraints.

 

Type:
Poster / Demo
City:
Vienna
Date:
2024-05-07
Department:
Data Science
Eurecom Ref:
8085
Copyright:
© EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in ICLR 2024, 12th International Conference on Learning Representations, 3rd Blogpost Track, 7-11 May 2024, Vienna, Austria and is available at :

PERMALINK : https://www.eurecom.fr/publication/8085