IDE: A System for Iterative Mislabel Detection Y Deng, D Qiyan, C Chai, L Cao, N Tang, J Fan, J Wang, Y Yuan, G Wang Companion of the 2024 International Conference on Management of Data, 500-503, 2024 | 2 | 2024 |
The Dawn of Natural Language to SQL: Are We Fully Ready? B Li, Y Luo, C Chai, G Li, N Tang arXiv preprint arXiv:2406.01265, 2024 | | 2024 |
MisDetect: Iterative Mislabel Detection using Early Loss Y Deng, C Chai, L Cao, N Tang, J Wang, J Fan, Y Yuan, G Wang Association for Computing Machinery (ACM), 2024 | 3 | 2024 |
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes Y Deng, C Chai, L Cao, Q Yuan, S Chen, Y Yu, Z Sun, J Wang, J Li, Z Cao, ... Proceedings of the VLDB Endowment 17 (8), 1925-1938, 2024 | | 2024 |
PACE: Poisoning Attacks on Learned Cardinality Estimation J Zhang, C Zhang, G Li, C Chai Proceedings of the ACM on Management of Data 2 (1), 1-27, 2024 | 2 | 2024 |
Cardinality estimation using normalizing flow J Wang, C Chai, J Liu, G Li The VLDB Journal 33 (2), 323-348, 2024 | 2 | 2024 |
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration M Fan, X Han, J Fan, C Chai, N Tang, G Li, X Du arXiv preprint arXiv:2312.03987, 2023 | 2 | 2023 |
Efficient coreset selection with cluster-based methods C Chai, J Wang, N Tang, Y Yuan, J Liu, Y Deng, G Wang Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 5 | 2023 |
Goodcore: Data-effective and data-efficient machine learning through coreset selection over incomplete data C Chai, J Liu, N Tang, J Fan, D Miao, J Wang, Y Luo, G Li Proceedings of the ACM on Management of Data 1 (2), 1-27, 2023 | 14 | 2023 |
Demystifying Artificial Intelligence for Data Preparation C Chai, N Tang, J Fan, Y Luo Companion of the 2023 International Conference on Management of Data, 13-20, 2023 | 5 | 2023 |
Learned data-aware image representations of line charts for similarity search Y Luo, Y Zhou, N Tang, G Li, C Chai, L Shen Proceedings of the ACM on Management of Data 1 (1), 1-29, 2023 | 9 | 2023 |
Haipipe: Combining human-generated and machine-generated pipelines for data preparation S Chen, N Tang, J Fan, X Yan, C Chai, G Li, X Du Proceedings of the ACM on Management of Data 1 (1), 1-26, 2023 | 5 | 2023 |
A Topic-Aware Data Generation Framework for Math Word Problems T Zhao, C Chai, J Liu, G Li, J Feng, Z Liu International Conference on Database Systems for Advanced Applications, 286-302, 2023 | | 2023 |
Autoce: An accurate and efficient model advisor for learned cardinality estimation J Zhang, C Zhang, G Li, C Chai 2023 IEEE 39th International Conference on Data Engineering (ICDE), 2621-2633, 2023 | 7 | 2023 |
HOFD: An Outdated Fact Detector for Knowledge Bases S Hao, C Chai, G Li, N Tang, N Wang, X Yu IEEE Transactions on Knowledge and Data Engineering 35 (10), 10775-10789, 2023 | 1 | 2023 |
Cost-based or learning-based? A hybrid query optimizer for query plan selection X Yu, C Chai, G Li, J Liu Proceedings of the VLDB Endowment 15 (13), 3924-3936, 2022 | 32 | 2022 |
Coresets over multiple tables for feature-rich and data-efficient machine learning J Wang, C Chai, N Tang, J Liu, G Li Proceedings of the VLDB Endowment 16 (1), 64-76, 2022 | 10 | 2022 |
Dader: hands-off entity resolution with domain adaptation J Tu, X Han, J Fan, N Tang, C Chai, G Li, X Du Proceedings of the VLDB Endowment 15 (12), 3666-3669, 2022 | 9 | 2022 |
Interactively discovering and ranking desired tuples by data exploration X Qin, C Chai, Y Luo, T Zhao, N Tang, G Li, J Feng, X Yu, M Ouzzani The VLDB Journal 31 (4), 753-777, 2022 | 10 | 2022 |
Learnedsqlgen: Constraint-aware sql generation using reinforcement learning L Zhang, C Chai, X Zhou, G Li Proceedings of the 2022 International Conference on Management of Data, 945-958, 2022 | 17 | 2022 |