ConDA: state-based data augmentation for context-dependent text-to-SQL

D Wang, L Dou, W Che, J Wang, J Liu, L Li… - International Journal of …, 2024 - Springer
D Wang, L Dou, W Che, J Wang, J Liu, L Li, J Shang, L Tao, J Zhang, C Fu, X Song
International Journal of Machine Learning and Cybernetics, 2024Springer
The context-dependent text-to-SQL task has profound real-world implications, as it facilitates
users in extracting knowledge from vast databases, which allows users to acquire the
information interactively for better accuracy. Unfortunately, current models struggle to
address this task effectively due to the scarcity of data led by the high annotation overhead.
The most straightforward method for addressing this problem is data augmentation, which
aims at scaling up the parsing corpus. However, the naive methods suffer from the low …
Abstract
The context-dependent text-to-SQL task has profound real-world implications, as it facilitates users in extracting knowledge from vast databases, which allows users to acquire the information interactively for better accuracy. Unfortunately, current models struggle to address this task effectively due to the scarcity of data led by the high annotation overhead. The most straightforward method for addressing this problem is data augmentation, which aims at scaling up the parsing corpus. However, the naive methods suffer from the low diversity of the augmented data. To address this limitation, we propose the state-based CONtext-dependent text-to-SQL Data Augmentation (ConDA), which generate and filter augmented data based on the dialogue state, which has higher diversity. Experimental results show that ConDA yields performance improvement on all experimental datasets with an average boosting of , proving the effectiveness of our method.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果