H Liu,
P Abbeel - International Conference on Machine …, 2021 - proceedings.mlr.press
We introduce a new unsupervised pretraining objective for reinforcement learning. During
the unsupervised reward-free pretraining phase, the agent maximizes mutual information …