An overview of catastrophic ai risks

D Hendrycks, M Mazeika, T Woodside - arXiv preprint arXiv:2306.12001, 2023 - arxiv.org
Rapid advancements in artificial intelligence (AI) have sparked growing concerns among
experts, policymakers, and world leaders regarding the potential for increasingly advanced …

[图书][B] Human-centered AI

B Shneiderman - 2022 - books.google.com
The remarkable progress in algorithms for machine and deep learning have opened the
doors to new opportunities, and some dark possibilities. However, a bright future awaits …

Unsolved problems in ml safety

D Hendrycks, N Carlini, J Schulman… - arXiv preprint arXiv …, 2021 - arxiv.org
Machine learning (ML) systems are rapidly increasing in size, are acquiring new
capabilities, and are increasingly deployed in high-stakes settings. As with other powerful …

Large language model alignment: A survey

T Shen, R Jin, Y Huang, C Liu, W Dong, Z Guo… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent years have witnessed remarkable progress made in large language models (LLMs).
Such advancements, while garnering significant attention, have concurrently elicited various …

Recommender systems for sustainability: overview and research issues

A Felfernig, M Wundara, TNT Tran… - Frontiers in big …, 2023 - frontiersin.org
Sustainability development goals (SDGs) are regarded as a universal call to action with the
overall objectives of planet protection, ending of poverty, and ensuring peace and prosperity …

The challenge of understanding what users want: Inconsistent preferences and engagement optimization

J Kleinberg, S Mullainathan… - Management …, 2023 - pubsonline.informs.org
Online platforms have a wealth of data, run countless experiments, and use industrial-scale
algorithms to optimize user experience. Despite this, many users seem to regret the time …

A proposed framework on integrating health equity and racial justice into the artificial intelligence development lifecycle

I Dankwa-Mullan, EL Scheufele… - Journal of Health Care …, 2021 - muse.jhu.edu
The COVID-19 pandemic has created multiple opportunities to deploy artificial intelligence
(AI)-driven tools and applied interventions to understand, mitigate, and manage the …

Consequences of misaligned AI

S Zhuang, D Hadfield-Menell - Advances in Neural …, 2020 - proceedings.neurips.cc
AI systems often rely on two key components: a specified goal or reward function and an
optimization algorithm to compute the optimal behavior for that goal. This approach is …

Safeguarding the journalistic DNA: Attitudes towards the role of professional values in algorithmic news recommender designs

M Bastian, N Helberger, M Makhortykh - Digital Journalism, 2021 - Taylor & Francis
In contrast to the extensive debate on the influence of algorithmic news recommenders
(ANRs) on individual news diets, the interaction between such systems and journalistic …

Aligning AI optimization to community well-being

J Stray - International Journal of Community Well-Being, 2020 - Springer
This paper investigates incorporating community well-being metrics into the objectives of
optimization algorithms and the teams that build them. It documents two cases where a large …