Integrative biological simulation, neuropsychology, and AI safety

S Ziesche - Philosophies, 2021 - mdpi.com

This article is about a specific, but so far neglected peril of AI, which is that AI systems may
become existential as well as causing suffering risks for nonhuman animals. The AI value …

被引用次数：15 相关文章所有 3 个版本

[PDF] osf.io

Value cores for inner and outer alignment: simulating personality formation via iterated policy selection and preference learning with self-world modeling active …

A Safron, Z Sheikhbahaee, N Hay, J Orchard… - … Workshop on Active …, 2022 - Springer

Humanity faces multiple existential risks in the coming decades due to technological
advances in AI, and the possibility of unintended behaviors emerging from such systems …

被引用次数：6 相关文章所有 9 个版本

[PDF] arxiv.org

NeuroAI for AI Safety

P Mineault, N Zanichelli, JZ Peng, A Arkhipov… - arXiv preprint arXiv …, 2024 - arxiv.org

As AI systems become increasingly powerful, the need for safe AI has become more
pressing. Humans are an attractive model for AI safety: as the only known agents capable of …

[PDF][PDF] Dream of Being: Solving AI Alignment Problems with Active Inference Models of Agency and Socioemotional Value Learning

A Safron, Z Sheikhbahaee, N Hay, J Orchard, J Hoey - 2022 - scholar.archive.org

Humanity faces multiple existential risks in the coming decades due to technological
advances in AI, and the possibility of unintended behaviors emerging from such systems …

被引用次数：1 相关文章

Value Cores for Inner and Outer Alignment: Simulating Personality Formation via Iterated Policy Selection and Preference Learning with Self-World Modeling Active …

J Hoey - Active Inference: Third International Workshop, IWAI …, 2023 - books.google.com

Humanity faces multiple existential risks in the coming decades due to technological
advances in AI, and the possibility of unintended behaviors emerging from such systems …

[PDF] researchgate.net

[PDF][PDF] Value Coalition Cores for Inner and Outer Alignment: Simulating Personality Formation via Iterated Policy Selection and Preference Learning with Self-World …

A Safron11, Z Sheikhbahaee21, N Hay, J Orchard… - researchgate.net

Humanity faces multiple existential risks in the coming decades due to technological
advances in AI, and the possibility of unintended behaviors emerging from such systems …

[PDF] researchgate.net

[PDF][PDF] Dream of Being: An Active Inference Approach to Agency and Value Recognition in Socioemotional Contexts for Handling AI Alignment Problems

A Safron11, Z Sheikhbahaee21, N Hay, J Orchard… - researchgate.net

Humanity faces multiple existential risks in the coming decades due to technological
advances in AI, and the possibility of unintended behaviors emerging from such systems …

高级搜索

QQ 群