A survey on programmatic weak supervision

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2023 - dl.acm.org

Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

被引用次数：235 相关文章所有 3 个版本

[PDF] arxiv.org

Coannotating: Uncertainty-guided work allocation between human and large language models for data annotation

M Li, T Shi, C Ziems, MY Kan, NF Chen, Z Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Annotated data plays a critical role in Natural Language Processing (NLP) in training
models and evaluating their performance. Given recent developments in Large Language …

被引用次数：51 相关文章所有 7 个版本

[PDF] arxiv.org

Prboost: Prompt-based rule discovery and boosting for interactive weakly-supervised learning

R Zhang, Y Yu, P Shetty, L Song, C Zhang - arXiv preprint arXiv …, 2022 - arxiv.org

Weakly-supervised learning (WSL) has shown promising results in addressing label scarcity
on many NLP tasks, but manually designing a comprehensive, high-quality labeling rule set …

被引用次数：58 相关文章所有 6 个版本

[PDF] acm.org Full View

Language models in the loop: Incorporating prompting into weak supervision

R Smith, JA Fries, B Hancock, SH Bach - ACM/JMS Journal of Data …, 2024 - dl.acm.org

We propose a new strategy for applying large pre-trained language models to novel tasks
when labeled training data is limited. Rather than apply the model in a typical zero-shot or …

被引用次数：54 相关文章所有 6 个版本

[PDF] mlr.press

Improved active multi-task representation learning via lasso

Y Wang, Y Chen, K Jamieson… - … Conference on Machine …, 2023 - proceedings.mlr.press

To leverage the copious amount of data from source tasks and overcome the scarcity of the
target task samples, representation learning based on multi-task pretraining has become a …

被引用次数：12 相关文章所有 8 个版本

[PDF] aclanthology.org

Cold-start data selection for better few-shot language model fine-tuning: A prompt-based uncertainty propagation approach

Y Yu, R Zhang, R Xu, J Zhang, J Shen… - Proceedings of the 61st …, 2023 - aclanthology.org

We present PATRON, a prompt-based data selection method for pre-trained language
model fine-tuning under cold-start scenarios, ie, no initial labeled data are available. In …

被引用次数：19 相关文章所有 4 个版本

[PDF] neurips.cc

Characterizing the Impacts of Semi-supervised Learning for Weak Supervision

J Li, J Zhang, L Schmidt… - Advances in Neural …, 2024 - proceedings.neurips.cc

Labeling training data is a critical and expensive step in producing high accuracy ML
models, whether training from scratch or fine-tuning. To make labeling more efficient, two …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

VideoPro: A Visual Analytics Approach for Interactive Video Programming

J He, X Wang, KK Wong, X Huang… - … on Visualization and …, 2023 - ieeexplore.ieee.org

Constructing supervised machine learning models for real-world video analysis require
substantial labeled data, which is costly to acquire due to scarce domain expertise and …

被引用次数：9 相关文章所有 11 个版本

[PDF] arxiv.org

Automatic calibration and error correction for large language models via pareto optimal self-supervision

T Zhao, M Wei, JS Preston, H Poon - arXiv preprint arXiv:2306.16564, 2023 - arxiv.org

Large language models (LLMs) have demonstrated remarkable capabilities out of box for a
wide range of applications, yet accuracy still remains a major growth area, especially in …

被引用次数：18 相关文章所有 2 个版本

[PDF] arxiv.org

Cold-start data selection for few-shot language model fine-tuning: A prompt-based uncertainty propagation approach

Y Yu, R Zhang, R Xu, J Zhang, J Shen… - arXiv preprint arXiv …, 2022 - arxiv.org

Large Language Models have demonstrated remarkable few-shot performance, but the
performance can be sensitive to the selection of few-shot instances. We propose PATRON, a …

被引用次数：23 相关文章所有 2 个版本

高级搜索

QQ 群