Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021 | 668 | 2021 |
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 378 | 2022 |
Ethical and social risks of harm from language models. arXiv L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359 10, 2021 | 43 | 2021 |
The neuroscience of moral judgment: empirical and philosophical developments J May, CI Workman, J Haas, H Han Neuroscience and philosophy, 17-47, 2022 | 27 | 2022 |
Melting Pot 2.0 JP Agapiou, AS Vezhnevets, EA Duéñez-Guzmán, J Matyas, Y Mao, ... arXiv preprint arXiv:2211.13746, 2022 | 21 | 2022 |
Ethical and social risks of harm from Language Models. arXiv 2021 L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021 | 13 | 2021 |
Moral gridworlds: a theoretical proposal for modeling artificial moral cognition J Haas Minds and Machines 30 (2), 219-246, 2020 | 12 | 2020 |
Is synchronic self-control possible? J Haas Review of Philosophy and Psychology 12 (2), 397-424, 2021 | 9 | 2021 |
An empirical solution to the puzzle of weakness of will J Haas Synthese 195 (12), 5175-5195, 2018 | 9 | 2018 |
Doing Without Free Will: Spinoza and Contemporary Moral Problems JT Cook, J Haas, M Homan Lexington Books, 2015 | 7 | 2015 |
Reinforcement learning: A brief guide for philosophers of mind J Haas | 6 | 2022 |
Can hierarchical predictive coding explain binocular rivalry? J Haas Philosophical Psychology 34 (3), 424-444, 2021 | 5 | 2021 |
Artificial moral cognition: Learning from developmental psychology L Weidinger, M Reinecke, J Haas | 4 | 2022 |
The puzzle of evaluating moral cognition in artificial agents MG Reinecke, Y Mao, M Kunesch, EA Duéñez‐Guzmán, J Haas, JZ Leibo Cognitive Science 47 (8), e13315, 2023 | 3 | 2023 |
Two Theories of Moral Cognition J Haas Does Neuroscience Have Normative Implications?, 59-79, 2020 | 3 | 2020 |
Valuation mechanisms in moral cognition J Haas Behavioral and Brain Sciences 42, 2019 | 2 | 2019 |
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity Y Mao, MG Reinecke, M Kunesch, EA Duéñez-Guzmán, R Comanescu, ... arXiv preprint arXiv:2305.18269, 2023 | 1 | 2023 |
Holistic resource-rational analysis J Haas, C Klein Behavioral and Brain Sciences 43, 2020 | 1 | 2020 |
Recovering Spinoza’s Theory of Akrasia J Haas Goldenbaum e Kluz 2015, 27-42, 2015 | 1 | 2015 |
The evaluative mind J Haas Mind Design III: Philosophy, Psychology, and Artificial Intelligence, 2023 | | 2023 |