The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review

D Schwabe, K Becker, M Seyferth, A Klaß… - NPJ Digital …, 2024 - nature.com
The adoption of machine learning (ML) and, more specifically, deep learning (DL)
applications into all major areas of our lives is underway. The development of trustworthy AI …

The diagnostic and triage accuracy of digital and online symptom checker tools: a systematic review

W Wallace, C Chan, S Chidambaram, L Hanna… - NPJ digital …, 2022 - nature.com
Digital and online symptom checkers are an increasingly adopted class of health
technologies that enable patients to input their symptoms and biodata to produce a set of …

[HTML][HTML] Large language models encode clinical knowledge

K Singhal, S Azizi, T Tu, SS Mahdavi, J Wei, HW Chung… - Nature, 2023 - nature.com
Large language models (LLMs) have demonstrated impressive capabilities, but the bar for
clinical applications is high. Attempts to assess the clinical knowledge of models typically …

Large language models encode clinical knowledge

K Singhal, S Azizi, T Tu, SS Mahdavi, J Wei… - arXiv preprint arXiv …, 2022 - arxiv.org
Large language models (LLMs) have demonstrated impressive capabilities in natural
language understanding and generation, but the quality bar for medical and clinical …

The value of standards for health datasets in artificial intelligence-based applications

A Arora, JE Alderman, J Palmer, S Ganapathi… - Nature Medicine, 2023 - nature.com
Artificial intelligence as a medical device is increasingly being applied to healthcare for
diagnosis, risk stratification and resource allocation. However, a growing body of evidence …

Investigating Practices and Opportunities for Cross-functional Collaboration around AI Fairness in Industry Practice

WH Deng, N Yildirim, M Chang, M Eslami… - Proceedings of the …, 2023 - dl.acm.org
An emerging body of research indicates that ineffective cross-functional collaboration–the
interdisciplinary work done by industry practitioners across roles–represents a major barrier …

A hunt for the snark: Annotator diversity in data practices

S Kapania, AS Taylor, D Wang - … of the 2023 CHI Conference on Human …, 2023 - dl.acm.org
Diversity in datasets is a key component to building responsible AI/ML. Despite this
recognition, we know little about the diversity among the annotators involved in data …

Learning from data with structured missingness

R Mitra, SF McGough, T Chakraborti… - Nature Machine …, 2023 - nature.com
Missing data are an unavoidable complication in many machine learning tasks. When data
are 'missing at random'there exist a range of tools and techniques to deal with the issue …

Augmented datasheets for speech datasets and ethical decision-making

O Papakyriakopoulos, ASG Choi, W Thong… - Proceedings of the …, 2023 - dl.acm.org
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …

Developing robust benchmarks for driving forward AI innovation in healthcare

D Mincu, S Roy - Nature Machine Intelligence, 2022 - nature.com
Abstract Machine learning technologies have seen increased application to the healthcare
domain. The main drivers are openly available healthcare datasets, and a general interest …