作者
Minh-Tien Nguyen, Dung Tien Le, Nguyen Hong Son, Bui Cong Minh
发表日期
2020/10
研讨会论文
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
页码范围
478-487
简介
Transformers have recently achieved promising results in many natural language processing tasks; however, the understanding of transformers for information extraction in business scenarios is still an open question. This paper bridges the gap by introducing an investigation to understand the behavior of transformers in extracting information from domainspecific business documents. To do that, we employ transformers for taking advantage of these architectures trained on a huge amount of general data and fine-tune transformers to our down-stream IE task by using transfer learning. Experimental results on three Japanese datasets show that there are small margins among transformers in terms of F-scores but some models can achieve high accuracy with a small number of training data.
引用总数
学术搜索中的文章
MT Nguyen, DT Le, NH Son, BC Minh - Proceedings of the 34th Pacific Asia Conference on …, 2020