[HTML][HTML] Reinforcement learning for disassembly system optimization problems: A survey

X Guo, Z Bi, J Wang, S Qin, S Liu, L Qi - International Journal of Network …, 2023 - sciltp.com
The disassembly complexity of end-of-life products increases continuously. Traditional
methods are facing difficulties in solving the decision-making and control problems of …

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press
In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

Soft actor-critic for discrete action settings

P Christodoulou - arXiv preprint arXiv:1910.07207, 2019 - arxiv.org
Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action
settings that is not applicable to discrete action settings. Many important settings involve …

A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Pretraining representations for data-efficient reinforcement learning

M Schwarzer, N Rajkumar… - Advances in …, 2021 - proceedings.neurips.cc
Data efficiency is a key challenge for deep reinforcement learning. We address this problem
by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of …

Uncertainty-aware model-based reinforcement learning: Methodology and application in autonomous driving

J Wu, Z Huang, C Lv - IEEE Transactions on Intelligent Vehicles, 2022 - ieeexplore.ieee.org
To further improve learning efficiency and performance of reinforcement learning (RL), a
novel uncertainty-aware model-based RL method is proposed and validated in autonomous …

Elastic decision transformer

YH Wu, X Wang, M Hamaya - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract This paper introduces Elastic Decision Transformer (EDT), a significant
advancement over the existing Decision Transformer (DT) and its variants. Although DT …

Deepmdp: Learning continuous latent space models for representation learning

C Gelada, S Kumar, J Buckman… - International …, 2019 - proceedings.mlr.press
Many reinforcement learning (RL) tasks provide the agent with high-dimensional
observations that can be simplified into low-dimensional continuous states. To formalize this …

Planning with goal-conditioned policies

S Nasiriany, V Pong, S Lin… - Advances in neural …, 2019 - proceedings.neurips.cc
Planning methods can solve temporally extended sequential decision making problems by
composing simple behaviors. However, planning requires suitable abstractions for the states …

The efficiency misnomer

M Dehghani, A Arnab, L Beyer, A Vaswani… - arXiv preprint arXiv …, 2021 - arxiv.org
Model efficiency is a critical aspect of developing and deploying machine learning models.
Inference time and latency directly affect the user experience, and some applications have …