Current and near-term AI as a potential existential risk factor BS Bucknall, S Dori-Hacohen Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, 119-129, 2022 | 36 | 2022 |
Black-Box Access is Insufficient for Rigorous AI Audits S Casper, C Ezell, C Siegmann, N Kolt, TL Curtis, B Bucknall, A Haupt, ... arXiv preprint arXiv:2401.14446, 2024 | 18 | 2024 |
Open-Sourcing Highly Capable Foundation Models: An Evaluation of Risks, Benefits, and Alternative Methods for Pursuing Open-Source Objectives E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ... | 13 | 2023 |
Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework M Anderljung, ET Smith, J O'Brien, L Soder, B Bucknall, E Bluemke, ... arXiv preprint arXiv:2311.14711, 2023 | 9* | 2023 |
Structured Access for Third-Party Research on Frontier AI Models: Investigating Researchers' Model Access Requirements BS Bucknall, RF Trager | 7 | 2023 |
Position Paper: Technical Research and Talent is Needed for Effective AI Governance A Reuel, L Soder, B Bucknall, TA Undheim arXiv preprint arXiv:2406.06987, 2024 | | 2024 |
Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models A Chan, B Bucknall, H Bradley, D Krueger arXiv preprint arXiv:2312.14751, 2023 | | 2023 |
Promoting Exploration in Reinforcement Learning through Surprise-Based Intrinsic Motivation BS Bucknall | | 2022 |