Behavioral diversity, expert imitation, fairness, safety goals and others give rise to preferences in sequential decision making domains that do not decompose additively …
Deep learning has proven to be effective in a wide variety of loss minimization problems. However, many applications of interest, like minimizing projected Bellman error and min …
X Chen, N He, Y Hu, Z Ye - Operations Research, 2024 - pubsonline.informs.org
We study a class of stochastic nonconvex optimization in the form of min x∈ XF (x)≔ E ξ [f (ϕ (x, ξ))], that is, F is a composition of a convex function f and a random function ϕ. Leveraging …
Stochastic gradient methods are increasingly employed in statistical inference tasks, such as parameter and interval estimation. Yet, much of the current theoretical framework mainly …