Data management for machine learning: A survey C Chai, J Wang, Y Luo, Z Niu, G Li IEEE Transactions on Knowledge and Data Engineering 35 (5), 4646-4667, 2022 | 73 | 2022 |
FACE: a normalizing flow based cardinality estimator J Wang, C Chai, J Liu, G Li Proceedings of the VLDB Endowment 15 (1), 72-84, 2021 | 69 | 2021 |
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data C Chai, J Liu, N Tang, J Fan, D Miao, J Wang, Y Luo, G Li Proceedings of the ACM on Management of Data 1 (2), 1-27, 2023 | 22 | 2023 |
Coresets over multiple tables for feature-rich and data-efficient machine learning J Wang, C Chai, N Tang, J Liu, G Li Proceedings of the VLDB Endowment 16 (1), 64-76, 2022 | 18 | 2022 |
Efficient Coreset Selection with Cluster-based Methods C Chai, J Wang, N Tang, Y Yuan, J Liu, Y Deng, G Wang Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 9 | 2023 |
Cardinality estimation using normalizing flow J Wang, C Chai, J Liu, G Li The VLDB Journal 33 (2), 323-348, 2024 | 7 | 2024 |
MisDetect: Iterative Mislabel Detection using Early Loss Y Deng, C Chai, L Cao, N Tang, J Wang, J Fan, Y Yuan, G Wang Proceedings of the VLDB Endowment, 2024 | 5 | 2024 |
IDE: A System for Iterative Mislabel Detection Y Deng, Q Deng, C Chai, L Cao, N Tang, J Fan, J Wang, Y Yuan, G Wang Companion of the 2024 International Conference on Management of Data, 500-503, 2024 | 2 | 2024 |
AOP: Automated and Interactive LLM Pipeline Orchestration for Answering Complex Queries J Wang, G Li | | |