Academic plagiarism detection: a systematic literature review

T Foltýnek, N Meuschke, B Gipp - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
This article summarizes the research on computational methods to detect academic
plagiarism by systematically reviewing 239 research papers published between 2013 and …

[HTML][HTML] Psychographic traits identification based on political ideology: An author analysis study on spanish politicians' tweets posted in 2020

JA García-Díaz, R Colomo-Palacios… - Future Generation …, 2022 - Elsevier
In general, people are usually more reluctant to follow advice and directions from politicians
who do not have their ideology. In extreme cases, people can be heavily biased in favour of …

Towards robust and privacy-preserving text representations

Y Li, T Baldwin, T Cohn - arXiv preprint arXiv:1805.06093, 2018 - arxiv.org
Written text often provides sufficient clues to identify the author, their gender, age, and other
important attributes. Consequently, the authorship of training and evaluation corpora can …

Privacy-preservation in the context of natural language processing

D Mahendran, C Luo, BT Mcinnes - IEEE Access, 2021 - ieeexplore.ieee.org
Data privacy is one of the highly discussed issues in recent years as we encounter data
breaches and privacy scandals often. This raises a lot of concerns about the ways the data is …

Differentially private representation for nlp: Formal guarantee and an empirical study on privacy and fairness

L Lyu, X He, Y Li - arXiv preprint arXiv:2010.01285, 2020 - arxiv.org
It has been demonstrated that hidden representation learned by a deep model can encode
private information of the input, hence can be exploited to recover such information with …

Machine learning methods for stylometry

J Savoy - Cham: Springer, 2020 - Springer
With the recent progress made in network and computing technology, the ubiquity of data,
and textual repositories freely available, the scientific practice evolves towards a more data …

[PDF][PDF] Generalised differential privacy for text document processing

N Fernandes, M Dras, A McIver - … , POST 2019, Held as Part of the …, 2019 - library.oapen.org
We address the problem of how to “obfuscate” texts by removing stylistic clues which can
identify authorship, whilst preserving (as much as possible) the content of the text. In this …

N-gram: New groningen author-profiling model

A Basile, G Dwyer, M Medvedeva, J Rawee… - arXiv preprint arXiv …, 2017 - arxiv.org
We describe our participation in the PAN 2017 shared task on Author Profiling, identifying
authors' gender and language variety for English, Spanish, Arabic and Portuguese. We …

Privacy preserving text representation learning

G Beigi, K Shu, R Guo, S Wang, H Liu - … of the 30th ACM Conference on …, 2019 - dl.acm.org
Online users generate tremendous amounts of textual information by participating in
different online activities. This data provides opportunities for researchers and business …

Listening between the lines: Learning personal attributes from conversations

A Tigunova, A Yates, P Mirza, G Weikum - The World Wide Web …, 2019 - dl.acm.org
Open-domain dialogue agents must be able to converse about many topics while
incorporating knowledge about the user into the conversation. In this work we address the …