Population based training of neural networks

M Jaderberg, V Dalibard, S Osindero… - arXiv preprint arXiv …, 2017 - arxiv.org
Neural networks dominate the modern machine learning landscape, but their training and
success still suffer from sensitivity to empirical choices of hyperparameters such as model …

Hyperparameter tuning for deep reinforcement learning applications

M Kiran, M Ozyildirim - arXiv preprint arXiv:2201.11182, 2022 - arxiv.org
Reinforcement learning (RL) applications, where an agent can simply learn optimal
behaviors by interacting with the environment, are quickly gaining tremendous success in a …

Bayesian generational population-based training

X Wan, C Lu, J Parker-Holder, PJ Ball… - International …, 2022 - proceedings.mlr.press
Reinforcement learning (RL) offers the potential for training generally capable agents that
can interact autonomously in the real world. However, one key limitation is the brittleness of …

Pre-translation for neural machine translation

J Niehues, E Cho, TL Ha, A Waibel - arXiv preprint arXiv:1610.05243, 2016 - arxiv.org
Recently, the development of neural machine translation (NMT) has significantly improved
the translation quality of automatic machine translation. While most sentences are more …

Sample-efficient automated deep reinforcement learning

JKH Franke, G Köhler, A Biedenkapp… - arXiv preprint arXiv …, 2020 - arxiv.org
Despite significant progress in challenging problems across various domains, applying state-
of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their …

Hyperparameters in reinforcement learning and how to tune them

T Eimer, M Lindauer… - … Conference on Machine …, 2023 - proceedings.mlr.press
In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

Unsupervised neural machine translation

M Artetxe, G Labaka, E Agirre, K Cho - arXiv preprint arXiv:1710.11041, 2017 - arxiv.org
In spite of the recent success of neural machine translation (NMT) in standard benchmarks,
the lack of large parallel corpora poses a major practical problem for many language pairs …

Sockeye: A toolkit for neural machine translation

F Hieber, T Domhan, M Denkowski, D Vilar… - arXiv preprint arXiv …, 2017 - arxiv.org
We describe Sockeye (version 1.12), an open-source sequence-to-sequence toolkit for
Neural Machine Translation (NMT). Sockeye is a production-ready framework for training …

Minimum risk training for neural machine translation

S Shen, Y Cheng, Z He, W He, H Wu, M Sun… - arXiv preprint arXiv …, 2015 - arxiv.org
We propose minimum risk training for end-to-end neural machine translation. Unlike
conventional maximum likelihood estimation, minimum risk training is capable of optimizing …

Adversarial neural machine translation

L Wu, Y Xia, F Tian, L Zhao, T Qin… - Asian Conference on …, 2018 - proceedings.mlr.press
In this paper, we study a new learning paradigm for neural machine translation (NMT).
Instead of maximizing the likelihood of the human translation as in previous works, we …