Language models don't always say what they think: unfaithful explanations in chain-of-thought prompting M Turpin, J Michael, E Perez, S Bowman Advances in Neural Information Processing Systems 36, 2024 | 196 | 2024 |
Foundational challenges in assuring alignment and safety of large language models U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ... arXiv preprint arXiv:2404.09932, 2024 | 32 | 2024 |
A machine learning toolkit for genetic engineering attribution to facilitate biosecurity EC Alley, M Turpin, AB Liu, T Kulp-McDowall, J Swett, R Edison, ... Nature Communications 11 (1), 6293, 2020 | 20 | 2020 |
Attribution of genetic engineering: A practical and accurate machine-learning toolkit for biosecurity EC Alley, M Turpin, AB Liu, T Kulp-McDowall, J Swett, R Edison, ... bioRxiv, 2020.08. 22.262576, 2020 | 4 | 2020 |
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought J Chua, E Rees, H Batra, SR Bowman, J Michael, E Perez, M Turpin arXiv preprint arXiv:2403.05518, 2024 | 3 | 2024 |
Machine Learning Prediction of Surgical Intervention for Small Bowel Obstruction M Turpin, J Watson, M Engelhard, R Henao, D Thompson, L Carin, A Kirk medRxiv, 2021.04. 13.21255428, 2021 | | 2021 |