Text categorization by learning predominant sense of words as auxiliary task

K Shimura, J Li, F Fukumoto - … of the 57th Annual Meeting of the …, 2019 - aclanthology.org
K Shimura, J Li, F Fukumoto
Proceedings of the 57th Annual Meeting of the Association for …, 2019aclanthology.org
Distributions of the senses of words are often highly skewed and give a strong influence of
the domain of a document. This paper follows the assumption and presents a method for text
categorization by leveraging the predominant sense of words depending on the domain, ie,
domain-specific senses. The key idea is that the features learned from predominant senses
are possible to discriminate the domain of the document and thus improve the overall
performance of text categorization. We propose multi-task learning framework based on the …
Abstract
Distributions of the senses of words are often highly skewed and give a strong influence of the domain of a document. This paper follows the assumption and presents a method for text categorization by leveraging the predominant sense of words depending on the domain, ie, domain-specific senses. The key idea is that the features learned from predominant senses are possible to discriminate the domain of the document and thus improve the overall performance of text categorization. We propose multi-task learning framework based on the neural network model, transformer, which trains a model to simultaneously categorize documents and predicts a predominant sense for each word. The experimental results using four benchmark datasets show that our method is comparable to the state-of-the-art categorization approach, especially our model works well for categorization of multi-label documents.
aclanthology.org
以上显示的是最相近的搜索结果。 查看全部搜索结果