Authorship identification using recurrent neural networks

STP Gupta, JK Sahoo, RK Roul - … on Information System and Data Mining, 2019 - dl.acm.org
Proceedings of the 2019 3rd International Conference on Information System …, 2019dl.acm.org
Authorship identification is the process of revealing the hidden identity of authors from a
corpus of literary data based on a stylometric analysis of the text. It has essential
applications in various fields, such as cyber-forensics, plagiarism detection, and political
socialization. This paper aims to use a deep learning approach for the task of authorship
identification by defining a suitable characterization of texts to capture the distinctive style of
an author. The proposed model uses an index based word embedding for the C50 and the …
Authorship identification is the process of revealing the hidden identity of authors from a corpus of literary data based on a stylometric analysis of the text. It has essential applications in various fields, such as cyber-forensics, plagiarism detection, and political socialization. This paper aims to use a deep learning approach for the task of authorship identification by defining a suitable characterization of texts to capture the distinctive style of an author. The proposed model uses an index based word embedding for the C50 and the BBC datasets, applied to the input data of article level Long Short Term Memory (LSTM) network and Gated Recurrent Unit (GRU) network models. A comparative study of this new variant of embeddings is done with the standard approach of pre-trained word embeddings.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果