Ai4vis: Survey on artificial intelligence approaches for data visualization

A Wu, Y Wang, X Shu, D Moritz, W Cui… - … on Visualization and …, 2021 - ieeexplore.ieee.org
Visualizations themselves have become a data format. Akin to other data formats such as
text and images, visualizations are increasingly created, stored, shared, and (re-) used with …

Natural language to visualization by neural machine translation

Y Luo, N Tang, G Li, J Tang, C Chai… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Supporting the translation from natural language (NL) query to visualization (NL2VIS) can
simplify the creation of data visualizations because if successful, anyone can generate …

A survey on ML4VIS: Applying machine learning advances to data visualization

Q Wang, Z Chen, Y Wang, H Qu - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Inspired by the great success of machine learning (ML), researchers have applied ML
techniques to visualizations to achieve a better design, development, and evaluation of …

Synthesizing natural language to visualization (NL2VIS) benchmarks from NL2SQL benchmarks

Y Luo, N Tang, G Li, C Chai, W Li, X Qin - Proceedings of the 2021 …, 2021 - dl.acm.org
Natural language (NL) is a promising interaction paradigm for data visualization (VIS).
However, there are not any NL to VIS (NL2VIS) benchmarks available. Our goal is to provide …

Data management for machine learning: A survey

C Chai, J Wang, Y Luo, Z Niu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Machine learning (ML) has widespread applications and has revolutionized many
industries, but suffers from several challenges. First, sufficient high-quality training data is …

Selective data acquisition in the wild for model charging

C Chai, J Liu, N Tang, G Li, Y Luo - Proceedings of the VLDB …, 2022 - dl.acm.org
The lack of sufficient labeled data is a key bottleneck for practitioners in many real-world
supervised machine learning (ML) tasks. In this paper, we study a new problem, namely …

VerifAI: verified generative AI

N Tang, C Yang, J Fan, L Cao, Y Luo… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI has made significant strides, yet concerns about the accuracy and reliability of
its outputs continue to grow. Such inaccuracies can have serious consequences such as …

Human-in-the-loop outlier detection

C Chai, L Cao, G Li, J Li, Y Luo, S Madden - Proceedings of the 2020 …, 2020 - dl.acm.org
Outlier detection is critical to a large number of applications from finance fraud detection to
health care. Although numerous approaches have been proposed to automatically detect …

Goodcore: Data-effective and data-efficient machine learning through coreset selection over incomplete data

C Chai, J Liu, N Tang, J Fan, D Miao, J Wang… - Proceedings of the …, 2023 - dl.acm.org
Given a dataset with incomplete data (eg, missing values), training a machine learning
model over the incomplete data requires two steps. First, it requires a data-effective step that …

Beast: Scalable exploratory analytics on spatio-temporal data

A Eldawy, V Hristidis, S Ghosh, M Saeedan… - Proceedings of the 30th …, 2021 - dl.acm.org
This paper introduces the open-source Beast system for scalable exploratory data science
on big spatio-temporal data. Beast is based on well-established research and has been …