Relation extraction using distant supervision: A survey

A Smirnova, P Cudré-Mauroux - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Relation extraction is a subtask of information extraction where semantic relationships are
extracted from natural language text and then classified. In essence, it allows us to acquire …

Accelerating human-in-the-loop machine learning: Challenges and opportunities

D Xin, L Ma, J Liu, S Macke, S Song… - Proceedings of the …, 2018 - dl.acm.org
Development of machine learning (ML) workflows is a tedious process of iterative
experimentation: developers repeatedly make changes to workflows until the desired …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Representation learning for dynamic graphs: A survey

SM Kazemi, R Goel, K Jain, I Kobyzev, A Sethi… - Journal of Machine …, 2020 - jmlr.org
Graphs arise naturally in many real-world applications including social networks,
recommender systems, ontologies, biology, and computational finance. Traditionally …

De-identification of patient notes with recurrent neural networks

F Dernoncourt, JY Lee, O Uzuner… - Journal of the American …, 2017 - academic.oup.com
Objective: Patient notes in electronic health records (EHRs) may contain critical information
for medical investigations. However, the vast majority of medical investigators can only …

Data lifecycle challenges in production machine learning: a survey

N Polyzotis, S Roy, SE Whang, M Zinkevich - ACM SIGMOD Record, 2018 - dl.acm.org
Machine learning has become an essential tool for gleaning knowledge from data and
tackling a diverse set of computationally hard tasks. However, the accuracy of a machine …

Semantic search on text and knowledge bases

H Bast, B Buchhold, E Haussmann - Foundations and Trends® …, 2016 - nowpublishers.com
This article provides a comprehensive overview of the broad area of semantic search on text
and knowledge bases. In a nutshell, semantic search is “search with meaning”. This …

Sports big data: management, analysis, applications, and challenges

Z Bai, X Bai - Complexity, 2021 - Wiley Online Library
With the rapid growth of information technology and sports, analyzing sports information has
become an increasingly challenging issue. Sports big data come from the Internet and show …

Helix: Holistic optimization for accelerating iterative machine learning

D Xin, S Macke, L Ma, J Liu, S Song… - arXiv preprint arXiv …, 2018 - arxiv.org
Machine learning workflow development is a process of trial-and-error: developers iterate on
workflows by testing out small modifications until the desired accuracy is achieved …

HET-GMP: A graph-based system approach to scaling large embedding model training

X Miao, Y Shi, H Zhang, X Zhang, X Nie… - Proceedings of the …, 2022 - dl.acm.org
Embedding models have been recognized as an effective learning paradigm for high-
dimensional data. However, a major embedding model training obstacle is that updating …