Word embeddings for code-mixed language processing- 学术资源搜索

文章

学术资源搜索

Word embeddings for code-mixed language processing

A Pratapa, M Choudhury, S Sitaram - Proceedings of the 2018 …, 2018 - aclanthology.org

Proceedings of the 2018 conference on empirical methods in natural …, 2018•aclanthology.org

We compare three existing bilingual word embedding approaches, and a novel approach of
training skip-grams on synthetic code-mixed text generated through linguistic models of
code-mixing, on two tasks-sentiment analysis and POS tagging for code-mixed text. Our
results show that while CVM and CCA based embeddings perform as well as the proposed
embedding technique on semantic and syntactic tasks respectively, the proposed approach
provides the best performance for both tasks overall. Thus, this study demonstrates that …

Abstract

We compare three existing bilingual word embedding approaches, and a novel approach of training skip-grams on synthetic code-mixed text generated through linguistic models of code-mixing, on two tasks-sentiment analysis and POS tagging for code-mixed text. Our results show that while CVM and CCA based embeddings perform as well as the proposed embedding technique on semantic and syntactic tasks respectively, the proposed approach provides the best performance for both tasks overall. Thus, this study demonstrates that existing bilingual embedding techniques are not ideal for code-mixed text processing and there is a need for learning multilingual word embedding from the code-mixed text.

aclanthology.org

展开收起

被引用次数：69 相关文章所有 2 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Word embeddings for code-mixed language processing

引用