Assuring the machine learning lifecycle: Desiderata, methods, and challenges

R Ashmore, R Calinescu, C Paterson - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Machine learning has evolved into an enabling technology for a wide range of highly
successful applications. The potential for this success to continue and accelerate has placed …

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arXiv preprint arXiv …, 2021 - arxiv.org
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

Taxonomy of machine learning safety: A survey and primer

S Mohseni, H Wang, C Xiao, Z Yu, Z Wang… - ACM Computing …, 2022 - dl.acm.org
The open-world deployment of Machine Learning (ML) algorithms in safety-critical
applications such as autonomous vehicles needs to address a variety of ML vulnerabilities …

Domain randomization and pyramid consistency: Simulation-to-real generalization without accessing target domain data

X Yue, Y Zhang, S Zhao… - Proceedings of the …, 2019 - openaccess.thecvf.com
We propose to harness the potential of simulation for semantic segmentation of real-world
self-driving scenes in a domain generalization fashion. The segmentation network is trained …

Three-dimensional convolutional neural networks and a cross-docked data set for structure-based drug design

PG Francoeur, T Masuda, J Sunseri, A Jia… - Journal of chemical …, 2020 - ACS Publications
One of the main challenges in drug discovery is predicting protein–ligand binding affinity.
Recently, machine learning approaches have made substantial progress on this task …

VerifAI: A Toolkit for the Formal Design and Analysis of Artificial Intelligence-Based Systems

T Dreossi, DJ Fremont, S Ghosh, E Kim… - … on Computer Aided …, 2019 - Springer
We present VerifAI, a software toolkit for the formal design and analysis of systems that
include artificial intelligence (AI) and machine learning (ML) components. VerifAI particularly …

Formal scenario-based testing of autonomous vehicles: From simulation to the real world

DJ Fremont, E Kim, YV Pant, SA Seshia… - 2020 IEEE 23rd …, 2020 - ieeexplore.ieee.org
We present a new approach to automated scenario-based testing of the safety of
autonomous vehicles, especially those using advanced artificial intelligence-based …

Requirements-driven test generation for autonomous vehicles with machine learning components

CE Tuncali, G Fainekos, D Prokhorov… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Autonomous vehicles are complex systems that are challenging to test and debug. A
requirements-driven approach to the development process can decrease the resources …

Global and local texture randomization for synthetic-to-real semantic segmentation

D Peng, Y Lei, L Liu, P Zhang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Semantic segmentation is a crucial image understanding task, where each pixel of image is
categorized into a corresponding label. Since the pixel-wise labeling for ground-truth is …

Scenic: A language for scenario specification and data generation

DJ Fremont, E Kim, T Dreossi, S Ghosh, X Yue… - Machine Learning, 2023 - Springer
We propose a new probabilistic programming language for the design and analysis of cyber-
physical systems, especially those based on machine learning. We consider several …