Deep inventory management

Z Abbas, R Zhao, J Modayil, A White… - … on Lifelong Learning …, 2023 - proceedings.mlr.press

In this paper, we characterize the behavior of canonical value-based deep reinforcement
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …

被引用次数：81 相关文章所有 4 个版本

[HTML] sciencedirect.com

[HTML][HTML] An analysis of multi-agent reinforcement learning for decentralized inventory control systems

M Mousa, D van de Berg, N Kotecha… - Computers & Chemical …, 2024 - Elsevier

Most solutions to the inventory management problem assume a centralization of information
that is incompatible with organizational constraints in supply chain networks. The problem …

被引用次数：14 相关文章所有 2 个版本

[PDF] neurips.cc

Policy optimization for continuous reinforcement learning

H Zhao, W Tang, D Yao - Advances in Neural Information …, 2024 - proceedings.neurips.cc

We study reinforcement learning (RL) in the setting of continuous time and space, for an
infinite horizon with a discounted objective and the underlying dynamics driven by a …

被引用次数：18 相关文章所有 6 个版本

[PDF] mlr.press

Hindsight learning for mdps with exogenous inputs

SR Sinclair, FV Frujeri, CA Cheng… - International …, 2023 - proceedings.mlr.press

Many resource management problems require sequential decision-making under
uncertainty, where the only uncertainty affecting the decision outcomes are exogenous …

被引用次数：21 相关文章所有 7 个版本

[PDF] ssrn.com

Algorithmic and human collusion

T Werner - Available at SSRN 3960738, 2024 - papers.ssrn.com

I study self-learning pricing algorithms and show that they are collusive in market
simulations. To derive a counterfactual that resembles traditional tacit collusion, I conduct …

被引用次数：28 相关文章所有 14 个版本

Deep reinforcement learning for continuous wood drying production line control

FA Tremblay, A Durand, M Morin, P Marier… - Computers in …, 2024 - Elsevier

Continuous high-frequency wood drying, when integrated with a traditional wood finishing
line, allows correcting moisture content one piece of lumber at a time in order to improve its …

被引用次数：4 相关文章所有 3 个版本

[PDF] mlr.press

Model-based reinforcement learning with scalable composite policy gradient estimators

P Parmas, T Seno, Y Aoki - International Conference on …, 2023 - proceedings.mlr.press

In model-based reinforcement learning (MBRL), policy gradients can be estimated either by
derivative-free RL methods, such as likelihood ratio gradients (LR), or by backpropagating …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Deep neural newsvendor

J Han, M Hu, G Shen - arXiv preprint arXiv:2309.13830, 2023 - arxiv.org

We consider a data-driven newsvendor problem, where one has access to past demand
data and the associated feature information. We solve the problem by estimating the target …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Neural inventory control in networks via hindsight differentiable policy optimization

M Alvo, D Russo, Y Kanoria - arXiv preprint arXiv:2306.11246, 2023 - arxiv.org

Inventory management offers unique opportunities for reliably evaluating and applying deep
reinforcement learning (DRL). Rather than evaluate DRL algorithms by comparing against …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Pivoting Retail Supply Chain with Deep Generative Techniques: Taxonomy, Survey and Insights

Y Wang, LK Sambasivan, M Fu, P Mehrotra - arXiv preprint arXiv …, 2024 - arxiv.org

Generative AI applications, such as ChatGPT or DALL-E, have shown the world their
impressive capabilities in generating human-like text or image. Diving deeper, the science …

被引用次数：2 相关文章所有 2 个版本

高级搜索

QQ 群

Loss of plasticity in continual deep reinforcement learning

[HTML][HTML] An analysis of multi-agent reinforcement learning for decentralized inventory control systems

Policy optimization for continuous reinforcement learning

Hindsight learning for mdps with exogenous inputs

Algorithmic and human collusion

Deep reinforcement learning for continuous wood drying production line control

Model-based reinforcement learning with scalable composite policy gradient estimators

Deep neural newsvendor

Neural inventory control in networks via hindsight differentiable policy optimization

Pivoting Retail Supply Chain with Deep Generative Techniques: Taxonomy, Survey and Insights

引用