A survey of controllable text generation using transformer-based pre-trained language models

H Zhang, H Song, S Li, M Zhou, D Song - ACM Computing Surveys, 2023 - dl.acm.org
Controllable Text Generation (CTG) is an emerging area in the field of natural language
generation (NLG). It is regarded as crucial for the development of advanced text generation …

A survey on task assignment in crowdsourcing

D Hettiachchi, V Kostakos, J Goncalves - ACM Computing Surveys …, 2022 - dl.acm.org
Quality improvement methods are essential to gathering high-quality crowdsourced data,
both for research and industry applications. A popular and broadly applicable method is task …

All that's' human'is not gold: Evaluating human evaluation of generated text

E Clark, T August, S Serrano, N Haduong… - arXiv preprint arXiv …, 2021 - arxiv.org
Human evaluations are typically considered the gold standard in natural language
generation, but as models' fluency improves, how well can evaluators detect and judge …

Evaluation of text generation: A survey

A Celikyilmaz, E Clark, J Gao - arXiv preprint arXiv:2006.14799, 2020 - arxiv.org
The paper surveys evaluation methods of natural language generation (NLG) systems that
have been developed in the last few years. We group NLG evaluation methods into three …

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

G Stein, J Cresswell, R Hosseinzadeh… - Advances in …, 2024 - proceedings.neurips.cc
We systematically study a wide variety of generative models spanning semantically-diverse
image datasets to understand and improve the feature extractors and metrics used to …

Empath: Understanding topic signals in large-scale text

E Fast, B Chen, MS Bernstein - Proceedings of the 2016 CHI conference …, 2016 - dl.acm.org
Human language is colored by a broad range of topics, but existing text analysis tools only
focus on a small number of them. We present Empath, a tool that can generate and validate …

Revolt: Collaborative crowdsourcing for labeling machine learning datasets

JC Chang, S Amershi, E Kamar - … of the 2017 CHI conference on human …, 2017 - dl.acm.org
Crowdsourcing provides a scalable and efficient way to construct labeled datasets for
training machine learning systems. However, creating comprehensive label guidelines for …

Credbank: A large-scale social media corpus with associated credibility annotations

T Mitra, E Gilbert - Proceedings of the international AAAI conference on …, 2015 - ojs.aaai.org
Social media has quickly risen to prominence as a news source, yet lingering doubts remain
about its ability to spread rumor and misinformation. Systematically studying this …

Street-level algorithms: A theory at the gaps between policy and decisions

A Alkhatib, M Bernstein - Proceedings of the 2019 CHI Conference on …, 2019 - dl.acm.org
Errors and biases are earning algorithms increasingly malignant reputations in society. A
central challenge is that algorithms must bridge the gap between high-level policy and on …

A comprehensive study of image classification model sensitivity to foregrounds, backgrounds, and visual attributes

M Moayeri, P Pope, Y Balaji… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
While datasets with single-label supervision have propelled rapid advances in image
classification, additional annotations are necessary in order to quantitatively assess how …