- 学术资源搜索

[HTML][HTML] Reinforcement learning for disassembly system optimization problems: A survey

X Guo, Z Bi, J Wang, S Qin, S Liu, L Qi - International Journal of Network …, 2023 - sciltp.com

The disassembly complexity of end-of-life products increases continuously. Traditional
methods are facing difficulties in solving the decision-making and control problems of …

被引用次数：43 相关文章所有 2 个版本

[PDF] mlr.press

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press

In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

被引用次数：42 相关文章所有 6 个版本

[PDF] arxiv.org

Soft actor-critic for discrete action settings

P Christodoulou - arXiv preprint arXiv:1910.07207, 2019 - arxiv.org

Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action
settings that is not applicable to discrete action settings. Many important settings involve …

被引用次数：302 相关文章所有 2 个版本

[PDF] arxiv.org

A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org

The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

被引用次数：262 相关文章所有 14 个版本

[PDF] neurips.cc

Pretraining representations for data-efficient reinforcement learning

M Schwarzer, N Rajkumar… - Advances in …, 2021 - proceedings.neurips.cc

Data efficiency is a key challenge for deep reinforcement learning. We address this problem
by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of …

被引用次数：131 相关文章所有 6 个版本

[PDF] ntu.edu.sg

Uncertainty-aware model-based reinforcement learning: Methodology and application in autonomous driving

J Wu, Z Huang, C Lv - IEEE Transactions on Intelligent Vehicles, 2022 - ieeexplore.ieee.org

To further improve learning efficiency and performance of reinforcement learning (RL), a
novel uncertainty-aware model-based RL method is proposed and validated in autonomous …

被引用次数：74 相关文章所有 3 个版本

[PDF] neurips.cc

Elastic decision transformer

YH Wu, X Wang, M Hamaya - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract This paper introduces Elastic Decision Transformer (EDT), a significant
advancement over the existing Decision Transformer (DT) and its variants. Although DT …

被引用次数：17 相关文章所有 7 个版本

[PDF] mlr.press

Deepmdp: Learning continuous latent space models for representation learning

C Gelada, S Kumar, J Buckman… - International …, 2019 - proceedings.mlr.press

Many reinforcement learning (RL) tasks provide the agent with high-dimensional
observations that can be simplified into low-dimensional continuous states. To formalize this …

被引用次数：308 相关文章所有 4 个版本

[PDF] neurips.cc

Planning with goal-conditioned policies

S Nasiriany, V Pong, S Lin… - Advances in neural …, 2019 - proceedings.neurips.cc

Planning methods can solve temporally extended sequential decision making problems by
composing simple behaviors. However, planning requires suitable abstractions for the states …

被引用次数：227 相关文章所有 8 个版本

[PDF] arxiv.org

The efficiency misnomer

M Dehghani, A Arnab, L Beyer, A Vaswani… - arXiv preprint arXiv …, 2021 - arxiv.org

Model efficiency is a critical aspect of developing and deploying machine learning models.
Inference time and latency directly affect the user experience, and some applications have …

被引用次数：95 相关文章所有 3 个版本

高级搜索

QQ 群

[HTML][HTML] Reinforcement learning for disassembly system optimization problems: A survey

The dormant neuron phenomenon in deep reinforcement learning

Soft actor-critic for discrete action settings

A review on deep learning techniques for video prediction

Pretraining representations for data-efficient reinforcement learning

Uncertainty-aware model-based reinforcement learning: Methodology and application in autonomous driving

Elastic decision transformer

Deepmdp: Learning continuous latent space models for representation learning

Planning with goal-conditioned policies

The efficiency misnomer

引用