[HTML][HTML] Data and its (dis) contents: A survey of dataset development and use in machine learning research

A Paullada, ID Raji, EM Bender, E Denton, A Hanna - Patterns, 2021 - cell.com
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …

Big-data science in porous materials: materials genomics and machine learning

KM Jablonka, D Ongari, SM Moosavi, B Smit - Chemical reviews, 2020 - ACS Publications
By combining metal nodes with organic linkers we can potentially synthesize millions of
possible metal–organic frameworks (MOFs). The fact that we have so many materials opens …

AI and the everything in the whole wide world benchmark

ID Raji, EM Bender, A Paullada, E Denton… - arXiv preprint arXiv …, 2021 - arxiv.org
There is a tendency across different subfields in AI to valorize a small collection of influential
benchmarks. These benchmarks operate as stand-ins for a range of anointed common …

[PDF][PDF] Hyperparameter optimization

M Feurer, F Hutter - Automated machine learning: Methods …, 2019 - library.oapen.org
Recent interest in complex and computationally expensive machine learning models with
many hyperparameters, such as automated machine learning (AutoML) frameworks and …

Machine learning for data-driven discovery in solid Earth geoscience

KJ Bergen, PA Johnson, MV de Hoop, GC Beroza - Science, 2019 - science.org
BACKGROUND The solid Earth, oceans, and atmosphere together form a complex
interacting geosystem. Processes relevant to understanding Earth's geosystem behavior …

Deep learning for molecular design—a review of the state of the art

DC Elton, Z Boukouvalas, MD Fuge… - … Systems Design & …, 2019 - pubs.rsc.org
In the space of only a few years, deep generative modeling has revolutionized how we think
of artificial creativity, yielding autonomous systems which produce original images, music …

Neural text summarization: A critical evaluation

W Kryściński, NS Keskar, B McCann, C Xiong… - arXiv preprint arXiv …, 2019 - arxiv.org
Text summarization aims at compressing long documents into a shorter form that conveys
the most important parts of the original document. Despite increased interest in the …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

[图书][B] AI now report 2018

M Whittaker, K Crawford, R Dobbe, G Fried, E Kaziunas… - 2018 - stc.org
The AI Now Institute at New York University is an interdisciplinary research institute
dedicated to understanding the social implications of AI technologies. It is the first university …

Reduced, reused and recycled: The life of a dataset in machine learning research

B Koch, E Denton, A Hanna, JG Foster - arXiv preprint arXiv:2112.01716, 2021 - arxiv.org
Benchmark datasets play a central role in the organization of machine learning research.
They coordinate researchers around shared research problems and serve as a measure of …