Evallm: Interactive evaluation of large language model prompts on user-defined criteria

TS Kim, Y Lee, J Shin, YH Kim, J Kim - … of the CHI Conference on Human …, 2024 - dl.acm.org
By simply composing prompts, developers can prototype novel generative applications with
Large Language Models (LLMs). To refine prototypes into products, however, developers …

Farsight: Fostering Responsible AI Awareness During AI Application Prototyping

ZJ Wang, C Kulkarni, L Wilcox, M Terry… - Proceedings of the CHI …, 2024 - dl.acm.org
Prompt-based interfaces for Large Language Models (LLMs) have made prototyping and
building AI-powered applications easier than ever before. However, identifying potential …

Canvil: Designerly Adaptation for LLM-Powered User Experiences

KJ Feng, QV Liao, Z Xiao, JW Vaughan… - arXiv preprint arXiv …, 2024 - arxiv.org
Advancements in large language models (LLMs) are poised to spark a proliferation of LLM-
powered user experiences. In product teams, designers are often tasked with crafting user …

Wizmap: Scalable interactive visualization for exploring large machine learning embeddings

ZJ Wang, F Hohman, DH Chau - arXiv preprint arXiv:2306.09328, 2023 - arxiv.org
Machine learning models often learn latent embedding representations that capture the
domain semantics of their training data. These embedding representations are valuable for …

Only diff is not enough: Generating commit messages leveraging reasoning and action of large language model

J Li, D Faragó, C Petrov, I Ahmed - Proceedings of the ACM on Software …, 2024 - dl.acm.org
Commit messages play a vital role in software development and maintenance. While
previous research has introduced various Commit Message Generation (CMG) approaches …

Interactive reweighting for mitigating label quality issues

W Yang, Y Guo, J Wu, Z Wang, LZ Guo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Label quality issues, such as noisy labels and imbalanced class distributions, have negative
effects on model performance. Automatic reweighting methods identify problematic samples …

Towards a Non-Ideal Methodological Framework for Responsible ML

RK Mothilal, S Guha, SI Ahmed - arXiv preprint arXiv:2401.11131, 2024 - arxiv.org
Though ML practitioners increasingly employ various Responsible ML (RML) strategies,
their methodological approach in practice is still unclear. In particular, the constraints …

Towards a Non-Ideal Methodological Framework for Responsible ML

R Kommiya Mothilal, S Guha, SI Ahmed - Proceedings of the CHI …, 2024 - dl.acm.org
Though ML practitioners increasingly employ various Responsible ML (RML) strategies,
their methodological approach in practice is still unclear. In particular, the constraints …

Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments

A Boggust, V Sivaraman, Y Assogba… - … on Visualization and …, 2024 - ieeexplore.ieee.org
To deploy machine learning models on-device, practitioners use compression algorithms to
shrink and speed up models while maintaining their high-quality output. A critical aspect of …

A survey of visual analytics research for improving training data quality

W Yang, C Chen, J Zhu, L Li, P Liu, S Liu - Journal of Computer-Aided …, 2023 - jcad.cn
In the applications of machine learning, it is difficult to ensure the quality of training data due
to the various sources of training data and the inexperience of some annotators. By tightly …