ArXiv

Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise

Authors
Mathieu Laurière, Ariel Neufeld, Kyunghyun Park
Categories
math.OC, cs.AI, cs.LG, math.PR, stat.ML
arXiv
https://arxiv.org/abs/2606.20356v1
PDF
https://arxiv.org/pdf/2606.20356v1

Abstract

In this article, we present a robust $Q$-learning algorithm for discrete-time mean-field control problems under Wasserstein uncertainty in the common noise law. The algorithm combines a quantization-and-projection scheme with a Wasserstein dual reformulation on the common-noise space. We establish its convergence together with finite-time iteration bounds for both synchronous and asynchronous learning schemes. Numerical experiments on systemic risk and epidemic models compare the asynchronous implementation with an idealized Bellman iteration, illustrate the robustness-performance tradeoff under common-noise misspecification, and report the observed convergence behavior of the asynchronous $Q$-learning algorithm.