H Gu,
X Guo, X Wei,
R Xu - arXiv preprint arXiv:1911.07314, 2019 - researchgate.net
This paper establishes the time consistent property, ie, the dynamic programming principle
(DPP), for learning mean field controls (MFCs). The key idea is to define the correct form of …