关注
Chengliang Chai
Chengliang Chai
在 bit.edu.cn 的电子邮件经过验证
标题
引用次数
年份
IDE: A System for Iterative Mislabel Detection
Y Deng, D Qiyan, C Chai, L Cao, N Tang, J Fan, J Wang, Y Yuan, G Wang
Companion of the 2024 International Conference on Management of Data, 500-503, 2024
22024
The Dawn of Natural Language to SQL: Are We Fully Ready?
B Li, Y Luo, C Chai, G Li, N Tang
arXiv preprint arXiv:2406.01265, 2024
2024
MisDetect: Iterative Mislabel Detection using Early Loss
Y Deng, C Chai, L Cao, N Tang, J Wang, J Fan, Y Yuan, G Wang
Association for Computing Machinery (ACM), 2024
32024
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes
Y Deng, C Chai, L Cao, Q Yuan, S Chen, Y Yu, Z Sun, J Wang, J Li, Z Cao, ...
Proceedings of the VLDB Endowment 17 (8), 1925-1938, 2024
2024
PACE: Poisoning Attacks on Learned Cardinality Estimation
J Zhang, C Zhang, G Li, C Chai
Proceedings of the ACM on Management of Data 2 (1), 1-27, 2024
22024
Cardinality estimation using normalizing flow
J Wang, C Chai, J Liu, G Li
The VLDB Journal 33 (2), 323-348, 2024
22024
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration
M Fan, X Han, J Fan, C Chai, N Tang, G Li, X Du
arXiv preprint arXiv:2312.03987, 2023
22023
Efficient coreset selection with cluster-based methods
C Chai, J Wang, N Tang, Y Yuan, J Liu, Y Deng, G Wang
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
52023
Goodcore: Data-effective and data-efficient machine learning through coreset selection over incomplete data
C Chai, J Liu, N Tang, J Fan, D Miao, J Wang, Y Luo, G Li
Proceedings of the ACM on Management of Data 1 (2), 1-27, 2023
142023
Demystifying Artificial Intelligence for Data Preparation
C Chai, N Tang, J Fan, Y Luo
Companion of the 2023 International Conference on Management of Data, 13-20, 2023
52023
Learned data-aware image representations of line charts for similarity search
Y Luo, Y Zhou, N Tang, G Li, C Chai, L Shen
Proceedings of the ACM on Management of Data 1 (1), 1-29, 2023
92023
Haipipe: Combining human-generated and machine-generated pipelines for data preparation
S Chen, N Tang, J Fan, X Yan, C Chai, G Li, X Du
Proceedings of the ACM on Management of Data 1 (1), 1-26, 2023
52023
A Topic-Aware Data Generation Framework for Math Word Problems
T Zhao, C Chai, J Liu, G Li, J Feng, Z Liu
International Conference on Database Systems for Advanced Applications, 286-302, 2023
2023
Autoce: An accurate and efficient model advisor for learned cardinality estimation
J Zhang, C Zhang, G Li, C Chai
2023 IEEE 39th International Conference on Data Engineering (ICDE), 2621-2633, 2023
72023
HOFD: An Outdated Fact Detector for Knowledge Bases
S Hao, C Chai, G Li, N Tang, N Wang, X Yu
IEEE Transactions on Knowledge and Data Engineering 35 (10), 10775-10789, 2023
12023
Cost-based or learning-based? A hybrid query optimizer for query plan selection
X Yu, C Chai, G Li, J Liu
Proceedings of the VLDB Endowment 15 (13), 3924-3936, 2022
322022
Coresets over multiple tables for feature-rich and data-efficient machine learning
J Wang, C Chai, N Tang, J Liu, G Li
Proceedings of the VLDB Endowment 16 (1), 64-76, 2022
102022
Dader: hands-off entity resolution with domain adaptation
J Tu, X Han, J Fan, N Tang, C Chai, G Li, X Du
Proceedings of the VLDB Endowment 15 (12), 3666-3669, 2022
92022
Interactively discovering and ranking desired tuples by data exploration
X Qin, C Chai, Y Luo, T Zhao, N Tang, G Li, J Feng, X Yu, M Ouzzani
The VLDB Journal 31 (4), 753-777, 2022
102022
Learnedsqlgen: Constraint-aware sql generation using reinforcement learning
L Zhang, C Chai, X Zhou, G Li
Proceedings of the 2022 International Conference on Management of Data, 945-958, 2022
172022
系统目前无法执行此操作,请稍后再试。
文章 1–20