Overview of PAN’17: author identification, author profiling, and author obfuscation

T Foltýnek, N Meuschke, B Gipp - ACM Computing Surveys (CSUR), 2019 - dl.acm.org

This article summarizes the research on computational methods to detect academic
plagiarism by systematically reviewing 239 research papers published between 2013 and …

被引用次数：284 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Psychographic traits identification based on political ideology: An author analysis study on spanish politicians' tweets posted in 2020

JA García-Díaz, R Colomo-Palacios… - Future Generation …, 2022 - Elsevier

In general, people are usually more reluctant to follow advice and directions from politicians
who do not have their ideology. In extreme cases, people can be heavily biased in favour of …

被引用次数：72 相关文章所有 3 个版本

[PDF] arxiv.org

Towards robust and privacy-preserving text representations

Y Li, T Baldwin, T Cohn - arXiv preprint arXiv:1805.06093, 2018 - arxiv.org

Written text often provides sufficient clues to identify the author, their gender, age, and other
important attributes. Consequently, the authorship of training and evaluation corpora can …

被引用次数：213 相关文章所有 4 个版本

[PDF] ieee.org

Privacy-preservation in the context of natural language processing

D Mahendran, C Luo, BT Mcinnes - IEEE Access, 2021 - ieeexplore.ieee.org

Data privacy is one of the highly discussed issues in recent years as we encounter data
breaches and privacy scandals often. This raises a lot of concerns about the ways the data is …

被引用次数：23 相关文章所有 3 个版本

[PDF] arxiv.org

Differentially private representation for nlp: Formal guarantee and an empirical study on privacy and fairness

L Lyu, X He, Y Li - arXiv preprint arXiv:2010.01285, 2020 - arxiv.org

It has been demonstrated that hidden representation learned by a deep model can encode
private information of the input, hence can be exploited to recover such information with …

被引用次数：92 相关文章所有 5 个版本

Machine learning methods for stylometry

J Savoy - Cham: Springer, 2020 - Springer

With the recent progress made in network and computing technology, the ubiquity of data,
and textual repositories freely available, the scientific practice evolves towards a more data …

被引用次数：88 相关文章所有 4 个版本

[PDF] oapen.org

[PDF][PDF] Generalised differential privacy for text document processing

N Fernandes, M Dras, A McIver - … , POST 2019, Held as Part of the …, 2019 - library.oapen.org

We address the problem of how to “obfuscate” texts by removing stylistic clues which can
identify authorship, whilst preserving (as much as possible) the content of the text. In this …

被引用次数：135 相关文章所有 15 个版本

[PDF] arxiv.org

N-gram: New groningen author-profiling model

A Basile, G Dwyer, M Medvedeva, J Rawee… - arXiv preprint arXiv …, 2017 - arxiv.org

We describe our participation in the PAN 2017 shared task on Author Profiling, identifying
authors' gender and language variety for English, Spanish, Arabic and Portuguese. We …

被引用次数：95 相关文章所有 13 个版本

[PDF] arxiv.org

Privacy preserving text representation learning

G Beigi, K Shu, R Guo, S Wang, H Liu - … of the 30th ACM Conference on …, 2019 - dl.acm.org

Online users generate tremendous amounts of textual information by participating in
different online activities. This data provides opportunities for researchers and business …

被引用次数：52 相关文章所有 9 个版本

[PDF] arxiv.org

Listening between the lines: Learning personal attributes from conversations

A Tigunova, A Yates, P Mirza, G Weikum - The World Wide Web …, 2019 - dl.acm.org

Open-domain dialogue agents must be able to converse about many topics while
incorporating knowledge about the user into the conversation. In this work we address the …

被引用次数：48 相关文章所有 8 个版本

高级搜索

QQ 群