Deep analysis of an Arabic sentiment classification system based on lexical resource expansion and custom approaches building

I Touahri, A Mazroui - International Journal of Speech Technology, 2021 - Springer
International Journal of Speech Technology, 2021Springer
Sentiment analysis aims to extract emotions from a broad set of data. This paper studies the
impact of lexical resource enrichment on Arabic Sentiment Analysis. At first and as there is a
lack of Arabic lexical resources in the field of sentiment analysis, we build new resources
and use several lexicon construction methods. The first method is manual and it lies in
extracting sentimental words from a selected dataset and the second is semi-automatic and
based on translating an English lexicon into Arabic followed by a manual check. Both …
Abstract
Sentiment analysis aims to extract emotions from a broad set of data. This paper studies the impact of lexical resource enrichment on Arabic Sentiment Analysis. At first and as there is a lack of Arabic lexical resources in the field of sentiment analysis, we build new resources and use several lexicon construction methods. The first method is manual and it lies in extracting sentimental words from a selected dataset and the second is semi-automatic and based on translating an English lexicon into Arabic followed by a manual check. Both methods generate terms in word form. Besides the mentioned resources, the paper enriches an existing resource that contains terms related to four specific domains by creating its equivalent lemmatized version. Following various methods, we created lexicons with different morphologies to enrich the existing Arabic resources. Subsequently, these resources are used in developing a polarity classifier. The paper explains the followed steps to construct the different lexical resources, defines the pre-processing levels and gives statistics related to each lexicon. Then, we present the classification approaches we used to determine the polarity of the new data. In order to perform in depth analysis of the results in correspondence to the extracted features, we opt for the unsupervised and the supervised approaches that help to have a clear view on their internal architecture and process. The experiments are based on features alteration, besides opting for a feature selection approach in order to keep the most pertinent features and reduce the characteristic vector size. Moreover, we perform an in depth analysis of the characteristic vectors and corpus nature and we explain the main causes behind results improvement and degradation. The results of the tests carried out show the relevance of each component of the system.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果

Google学术搜索按钮

example.edu/paper.pdf
搜索
获取 PDF 文件
引用
References