相关文章- 学术资源搜索

Population based training of neural networks

M Jaderberg, V Dalibard, S Osindero… - arXiv preprint arXiv …, 2017 - arxiv.org

Neural networks dominate the modern machine learning landscape, but their training and
success still suffer from sensitivity to empirical choices of hyperparameters such as model …

被引用次数：877 相关文章所有 6 个版本

[PDF] arxiv.org

Hyperparameter tuning for deep reinforcement learning applications

M Kiran, M Ozyildirim - arXiv preprint arXiv:2201.11182, 2022 - arxiv.org

Reinforcement learning (RL) applications, where an agent can simply learn optimal
behaviors by interacting with the environment, are quickly gaining tremendous success in a …

被引用次数：31 相关文章所有 2 个版本

[PDF] mlr.press

Bayesian generational population-based training

X Wan, C Lu, J Parker-Holder, PJ Ball… - International …, 2022 - proceedings.mlr.press

Reinforcement learning (RL) offers the potential for training generally capable agents that
can interact autonomously in the real world. However, one key limitation is the brittleness of …

被引用次数：21 相关文章所有 9 个版本

[PDF] arxiv.org

Pre-translation for neural machine translation

J Niehues, E Cho, TL Ha, A Waibel - arXiv preprint arXiv:1610.05243, 2016 - arxiv.org

Recently, the development of neural machine translation (NMT) has significantly improved
the translation quality of automatic machine translation. While most sentences are more …

被引用次数：86 相关文章所有 3 个版本

[PDF] arxiv.org

Sample-efficient automated deep reinforcement learning

JKH Franke, G Köhler, A Biedenkapp… - arXiv preprint arXiv …, 2020 - arxiv.org

Despite significant progress in challenging problems across various domains, applying state-
of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their …

被引用次数：46 相关文章所有 7 个版本

[PDF] mlr.press

Hyperparameters in reinforcement learning and how to tune them

T Eimer, M Lindauer… - … Conference on Machine …, 2023 - proceedings.mlr.press

In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

被引用次数：32 相关文章所有 7 个版本

[PDF] arxiv.org

Unsupervised neural machine translation

M Artetxe, G Labaka, E Agirre, K Cho - arXiv preprint arXiv:1710.11041, 2017 - arxiv.org

In spite of the recent success of neural machine translation (NMT) in standard benchmarks,
the lack of large parallel corpora poses a major practical problem for many language pairs …

被引用次数：1022 相关文章所有 5 个版本

[PDF] arxiv.org

Sockeye: A toolkit for neural machine translation

F Hieber, T Domhan, M Denkowski, D Vilar… - arXiv preprint arXiv …, 2017 - arxiv.org

We describe Sockeye (version 1.12), an open-source sequence-to-sequence toolkit for
Neural Machine Translation (NMT). Sockeye is a production-ready framework for training …

被引用次数：239 相关文章所有 13 个版本

[PDF] arxiv.org

Minimum risk training for neural machine translation

S Shen, Y Cheng, Z He, W He, H Wu, M Sun… - arXiv preprint arXiv …, 2015 - arxiv.org

We propose minimum risk training for end-to-end neural machine translation. Unlike
conventional maximum likelihood estimation, minimum risk training is capable of optimizing …

被引用次数：508 相关文章所有 11 个版本

[PDF] mlr.press

Adversarial neural machine translation

L Wu, Y Xia, F Tian, L Zhao, T Qin… - Asian Conference on …, 2018 - proceedings.mlr.press

In this paper, we study a new learning paradigm for neural machine translation (NMT).
Instead of maximizing the likelihood of the human translation as in previous works, we …

被引用次数：154 相关文章所有 5 个版本

高级搜索

QQ 群

Population based training of neural networks

Hyperparameter tuning for deep reinforcement learning applications

Bayesian generational population-based training

Pre-translation for neural machine translation

Sample-efficient automated deep reinforcement learning

Hyperparameters in reinforcement learning and how to tune them

Unsupervised neural machine translation

Sockeye: A toolkit for neural machine translation

Minimum risk training for neural machine translation

Adversarial neural machine translation

引用