Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019 | 518* | 2019 |
Infobot: Transfer and exploration via the information bottleneck A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ... arXiv preprint arXiv:1901.10902, 2019 | 162 | 2019 |
The deterministic information bottleneck DJ Strouse, DJ Schwab Neural computation 29 (6), 1611-1630, 2017 | 161 | 2017 |
Collaborating with humans without human data DJ Strouse, K McKee, M Botvinick, E Hughes, R Everett Advances in Neural Information Processing Systems 34, 14502-14515, 2021 | 137 | 2021 |
In-context reinforcement learning with algorithm distillation M Laskin, L Wang, J Oh, E Parisotto, S Spencer, R Steigerwald, ... arXiv preprint arXiv:2210.14215, 2022 | 74 | 2022 |
Learning to share and hide intentions using information regularization DJ Strouse, M Kleiman-Weiner, J Tenenbaum, M Botvinick, DJ Schwab Advances in neural information processing systems 31, 2018 | 68 | 2018 |
Semantic exploration from language abstractions and pretrained representations A Tam, N Rabinowitz, A Lampinen, NA Roy, S Chan, DJ Strouse, J Wang, ... Advances in neural information processing systems 35, 25377-25389, 2022 | 57 | 2022 |
The information bottleneck and geometric clustering DJ Strouse, DJ Schwab Neural computation 31 (3), 596-612, 2019 | 38 | 2019 |
Learning more skills through optimistic exploration DJ Strouse, K Baumli, D Warde-Farley, V Mnih, S Hansen arXiv preprint arXiv:2107.14226, 2021 | 37 | 2021 |
A neural architecture for designing truthful and efficient auctions A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach arXiv preprint arXiv:1907.05181 3 (3.6), 4, 2019 | 32 | 2019 |
Melting Pot 2.0 JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ... arXiv preprint arXiv:2211.13746, 2022 | 17 | 2022 |
How dendrites affect online recognition memory X Wu, GC Mel, DJ Strouse, BW Mel PLoS computational biology 15 (5), e1006892, 2019 | 13 | 2019 |
Levinson's theorem for graphs AM Childs, DJ Strouse Journal of mathematical physics 52 (8), 2011 | 13 | 2011 |
Confronting reward model overoptimization with constrained rlhf T Moskovitz, AK Singh, DJ Strouse, T Sandholm, R Salakhutdinov, ... arXiv preprint arXiv:2310.04373, 2023 | 12 | 2023 |
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs AK Singh, DJ Strouse arXiv preprint arXiv:2402.14903, 2024 | 5 | 2024 |
Learning truthful, efficient, and welfare maximizing auction rules A Tacchetti, DJ Strouse, M Garnelo, T Graepel, Y Bachrach arXiv preprint arXiv:1907.05181, 2019 | 4 | 2019 |
Optimization of Mutual Information in Learning: Explorations in Science DJ Strouse Princeton University, 2018 | 1 | 2018 |
Neural network architecture for efficient resource allocation A Tacchetti, DJ Strouse, MG Abellanas, TKH Graepel, Y Bachrach US Patent 11,250,475, 2022 | | 2022 |