Human motion trajectory prediction: A survey

A Rudenko, L Palmieri, M Herman… - … Journal of Robotics …, 2020 - journals.sagepub.com
With growing numbers of intelligent autonomous systems in human environments, the ability
of such systems to perceive, understand, and anticipate human behavior becomes …

Reinforcement learning for demand response: A review of algorithms and modeling techniques

JR Vázquez-Canteli, Z Nagy - Applied energy, 2019 - Elsevier
Buildings account for about 40% of the global energy consumption. Renewable energy
resources are one possibility to mitigate the dependence of residential buildings on the …

Minimum cost flows, MDPs, and ℓ1-regression in nearly linear time for dense instances

J Van Den Brand, YT Lee, YP Liu, T Saranurak… - Proceedings of the 53rd …, 2021 - dl.acm.org
In this paper we provide new randomized algorithms with improved runtimes for solving
linear programs with two-sided constraints. In the special case of the minimum cost flow …

An approach for service function chain routing and virtual function network instance migration in network function virtualization architectures

V Eramo, E Miucci, M Ammar… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
Network function virtualization foresees the virtualization of service functions and their
execution on virtual machines. Any service is represented by a service function chain (SFC) …

Near-optimal time and sample complexities for solving Markov decision processes with a generative model

A Sidford, M Wang, X Wu, L Yang… - Advances in Neural …, 2018 - proceedings.neurips.cc
In this paper we consider the problem of computing an $\epsilon $-optimal policy of a
discounted Markov Decision Process (DMDP) provided we can only access its transition …

Joint status sampling and updating for minimizing age of information in the Internet of Things

B Zhou, W Saad - IEEE Transactions on Communications, 2019 - ieeexplore.ieee.org
The effective operation of time-critical Internet of things (IoT) applications requires real-time
reporting of fresh status information of underlying physical processes. In this paper, a real …

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer
In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

Offloading time optimization via Markov decision process in mobile-edge computing

G Yang, L Hou, X He, D He, S Chan… - IEEE internet of things …, 2020 - ieeexplore.ieee.org
Computation offloading from a mobile device to the edge server is an emerging paradigm to
reduce completion latency of intensive computations in mobile-edge computing (MEC). In …

[图书][B] Reinforcement learning: An introduction

RS Sutton, AG Barto - 2018 - books.google.com
The significantly expanded and updated new edition of a widely used text on reinforcement
learning, one of the most active research areas in artificial intelligence. Reinforcement …

Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org
This paper surveys the field of reinforcement learning from a computer-science perspective.
It is written to be accessible to researchers familiar with machine learning. Both the historical …