关注
John P Agapiou
John P Agapiou
其他姓名John Agapiou
Staff Research Engineer, Google DeepMind
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
nature 575 (7782), 350-354, 2019
46992019
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
19442016
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
12812018
Starcraft ii: A new challenge for reinforcement learning
O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ...
arXiv preprint arXiv:1708.04782, 2017
10882017
Strategic attentive writer for learning macro-actions
A Vezhnevets, V Mnih, S Osindero, A Graves, O Vinyals, J Agapiou
Advances in neural information processing systems 29, 2016
1822016
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ...
arXiv preprint arXiv:1704.03732, 2017
1812017
Scalable evaluation of multi-agent reinforcement learning with melting pot
JZ Leibo, EA Dueñez-Guzman, A Vezhnevets, JP Agapiou, P Sunehag, ...
International conference on machine learning, 6187-6199, 2021
862021
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
E Vinitsky, R Köster, JP Agapiou, EA Duéñez-Guzmán, AS Vezhnevets, ...
Collective Intelligence 2 (2), 26339137231162025, 2023
392023
The synaptic representation of sound source location in auditory cortex
P Chadderton, JP Agapiou, D McAlpine, TW Margrie
Journal of Neuroscience 29 (45), 14127-14135, 2009
322009
Melting Pot 2.0
JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ...
arXiv preprint arXiv:2211.13746, 2022
282022
Low-frequency envelope sensitivity produces asymmetric binaural tuning curves
JP Agapiou, D McAlpine
Journal of neurophysiology 100 (4), 2381-2396, 2008
282008
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia
AS Vezhnevets, JP Agapiou, A Aharon, R Ziv, J Matyas, ...
arXiv preprint arXiv:2312.03664, 2023
152023
Hidden agenda: a social deduction game with diverse learned equilibria
K Kopparapu, EA Duéñez-Guzmán, J Matyas, AS Vezhnevets, ...
arXiv preprint arXiv:2201.01816, 2022
112022
Heterogeneous social value orientation leads to meaningful diversity in sequential social dilemmas
U Madhushani, KR McKee, JP Agapiou, JZ Leibo, R Everett, T Anthony, ...
arXiv preprint arXiv:2305.00768, 2023
72023
Auto-aligning multiagent incentives with global objectives
M Kwon, JP Agapiou, EA Duéñez-Guzmán, R Elie, G Piliouras, K Bullard, ...
ICML Workshop on Localized Learning (LLW), 2023
52023
Learning agents that acquire representations of social groups
JZ Leibo, AS Vezhnevets, MK Eckstein, JP Agapiou, EA Duéñez-Guzmán
Behav. Brain Sci. 45, e111, 2022
22022
What is the simplest model that can account for high-fidelity imitation?
JZ Leibo, R Köster, AS Vezhnevets, EA Duénez-Guzmán, JP Agapiou, ...
Behavioral & Brain Sciences 45, 2022
12022
The Concordia Contest: Advancing the Cooperative Intelligence of Language Agents
C Smith, R Trivedi, J Clifton, L Hammond, A Khan, S Vezhnevets, ...
NeurIPS 2024 Competition Track, 0
系统目前无法执行此操作,请稍后再试。
文章 1–18