Challenges in deploying machine learning: a survey of case studies

A Paleyes, RG Urma, ND Lawrence - ACM computing surveys, 2022 - dl.acm.org
In recent years, machine learning has transitioned from a field of academic research interest
to a field capable of solving real-world business problems. However, the deployment of …

[HTML][HTML] A survey of open source tools for machine learning with big data in the Hadoop ecosystem

S Landset, TM Khoshgoftaar, AN Richter, T Hasanin - Journal of Big Data, 2015 - Springer
With an ever-increasing amount of options, the task of selecting machine learning tools for
big data can be difficult. The available tools have advantages and drawbacks, and many …

“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021 - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

A survey on FinTech

K Gai, M Qiu, X Sun - Journal of Network and Computer Applications, 2018 - Elsevier
As a new term in the financial industry, FinTech has become a popular term that describes
novel technologies adopted by the financial service institutions. This term covers a large …

Collaboration challenges in building ml-enabled systems: Communication, documentation, engineering, and process

N Nahar, S Zhou, G Lewis, C Kästner - Proceedings of the 44th …, 2022 - dl.acm.org
The introduction of machine learning (ML) components in software projects has created the
need for software engineers to collaborate with data scientists and other specialists. While …

Human factors in model interpretability: Industry practices, challenges, and needs

SR Hong, J Hullman, E Bertini - Proceedings of the ACM on Human …, 2020 - dl.acm.org
As the use of machine learning (ML) models in product development and data-driven
decision-making processes became pervasive in many domains, people's focus on building …

A survey of data partitioning and sampling methods to support big data analysis

MS Mahmud, JZ Huang, S Salloum… - Big Data Mining and …, 2020 - ieeexplore.ieee.org
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …

Exploration and explanation in computational notebooks

A Rule, A Tabard, JD Hollan - Proceedings of the 2018 CHI Conference …, 2018 - dl.acm.org
Computational notebooks combine code, visualizations, and text in a single document.
Researchers, data analysts, and even journalists are rapidly adopting this new medium. We …

How does machine learning change software development practices?

Z Wan, X Xia, D Lo, GC Murphy - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Adding an ability for a system to learn inherently adds uncertainty into the system. Given the
rising popularity of incorporating machine learning into systems, we wondered how the …

A multi-level typology of abstract visualization tasks

M Brehmer, T Munzner - IEEE transactions on visualization and …, 2013 - ieeexplore.ieee.org
The considerable previous work characterizing visualization usage has focused on low-level
tasks or interactions and high-level tasks, leaving a gap between them that is not addressed …