作者
Jiabin Liu, Chengliang Chai, Yuyu Luo, Yin Lou, Jianhua Feng, Nan Tang
发表日期
2022/5/9
研讨会论文
2022 IEEE 38th International Conference on Data Engineering (ICDE)
页码范围
3360-3372
出版商
IEEE
简介
Sufficient good features are indispensable to train well-performed machine learning models. However, it is com-mon that good features are not always enough, where feature augmentation is necessary to enrich high-quality features by joining with other tables. There are two main challenges for the problem. Given a set of tables where we can augment features from, the first challenge is that there are a lot of ways of joining multiple tables and deciding which features (or attributes) to use - selecting the best set of features to augment is hard. Moreover, we may need to materialize the join results for different join options, doing full materialization might be time consuming - efficient but approximate methods are needed. In this paper, we first introduce the design space of the feature augmentation problem. Then, to address the above challenges, we propose a reinforcement learning based framework, namely …
引用总数
学术搜索中的文章
J Liu, C Chai, Y Luo, Y Lou, J Feng, N Tang - 2022 IEEE 38th International Conference on Data …, 2022