Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... nature 575 (7782), 350-354, 2019 | 4699 | 2019 |
Hybrid computing using a neural network with dynamic external memory A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ... Nature 538 (7626), 471-476, 2016 | 1944 | 2016 |
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 1281 | 2018 |
Starcraft ii: A new challenge for reinforcement learning O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ... arXiv preprint arXiv:1708.04782, 2017 | 1088 | 2017 |
Strategic attentive writer for learning macro-actions A Vezhnevets, V Mnih, S Osindero, A Graves, O Vinyals, J Agapiou Advances in neural information processing systems 29, 2016 | 182 | 2016 |
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017 | 181 | 2017 |
Scalable evaluation of multi-agent reinforcement learning with melting pot JZ Leibo, EA Dueñez-Guzman, A Vezhnevets, JP Agapiou, P Sunehag, ... International conference on machine learning, 6187-6199, 2021 | 86 | 2021 |
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings E Vinitsky, R Köster, JP Agapiou, EA Duéñez-Guzmán, AS Vezhnevets, ... Collective Intelligence 2 (2), 26339137231162025, 2023 | 39 | 2023 |
The synaptic representation of sound source location in auditory cortex P Chadderton, JP Agapiou, D McAlpine, TW Margrie Journal of Neuroscience 29 (45), 14127-14135, 2009 | 32 | 2009 |
Melting Pot 2.0 JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ... arXiv preprint arXiv:2211.13746, 2022 | 28 | 2022 |
Low-frequency envelope sensitivity produces asymmetric binaural tuning curves JP Agapiou, D McAlpine Journal of neurophysiology 100 (4), 2381-2396, 2008 | 28 | 2008 |
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia AS Vezhnevets, JP Agapiou, A Aharon, R Ziv, J Matyas, ... arXiv preprint arXiv:2312.03664, 2023 | 15 | 2023 |
Hidden agenda: a social deduction game with diverse learned equilibria K Kopparapu, EA Duéñez-Guzmán, J Matyas, AS Vezhnevets, ... arXiv preprint arXiv:2201.01816, 2022 | 11 | 2022 |
Heterogeneous social value orientation leads to meaningful diversity in sequential social dilemmas U Madhushani, KR McKee, JP Agapiou, JZ Leibo, R Everett, T Anthony, ... arXiv preprint arXiv:2305.00768, 2023 | 7 | 2023 |
Auto-aligning multiagent incentives with global objectives M Kwon, JP Agapiou, EA Duéñez-Guzmán, R Elie, G Piliouras, K Bullard, ... ICML Workshop on Localized Learning (LLW), 2023 | 5 | 2023 |
Learning agents that acquire representations of social groups JZ Leibo, AS Vezhnevets, MK Eckstein, JP Agapiou, EA Duéñez-Guzmán Behav. Brain Sci. 45, e111, 2022 | 2 | 2022 |
What is the simplest model that can account for high-fidelity imitation? JZ Leibo, R Köster, AS Vezhnevets, EA Duénez-Guzmán, JP Agapiou, ... Behavioral & Brain Sciences 45, 2022 | 1 | 2022 |
The Concordia Contest: Advancing the Cooperative Intelligence of Language Agents C Smith, R Trivedi, J Clifton, L Hammond, A Khan, S Vezhnevets, ... NeurIPS 2024 Competition Track, 0 | | |