Large language models on graphs: A comprehensive survey

B Jin, G Liu, C Han, M Jiang, H Ji… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Large language models (LLMs), such as GPT4 and LLaMA, are creating significant
advancements in natural language processing, due to their strong text encoding/decoding …

Surveying stylometry techniques and applications

T Neal, K Sundararajan, A Fatima, Y Yan… - ACM Computing …, 2017 - dl.acm.org
The analysis of authorial style, termed stylometry, assumes that style is quantifiably
measurable for evaluation of distinctive qualities. Stylometry research has yielded several …

A survey of modern authorship attribution methods

E Stamatatos - Journal of the American Society for information …, 2009 - Wiley Online Library
Authorship attribution supported by statistical or computational methods has a long history
starting from the 19th century and is marked by the seminal study of Mosteller and Wallace …

Computational methods in authorship attribution

M Koppel, J Schler, S Argamon - Journal of the American …, 2009 - Wiley Online Library
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …

Large-scale Bayesian logistic regression for text categorization

A Genkin, DD Lewis, D Madigan - technometrics, 2007 - Taylor & Francis
Logistic regression analysis of high-dimensional data, such as natural language text, poses
computational and statistical challenges. Maximum likelihood estimation often fails in these …

On the feasibility of internet-scale author identification

A Narayanan, H Paskov, NZ Gong… - … IEEE Symposium on …, 2012 - ieeexplore.ieee.org
We study techniques for identifying an anonymous author via linguistic stylometry, ie,
comparing the writing style against a corpus of texts of known authorship. We experimentally …

Determining if two documents are written by the same author

M Koppel, Y Winter - Journal of the Association for Information …, 2014 - Wiley Online Library
Almost any conceivable authorship attribution problem can be reduced to one fundamental
problem: whether a pair of (possibly short) documents were written by the same author. In …

[PDF][PDF] Not all character n-grams are created equal: A study in authorship attribution

U Sapkota, S Bethard, M Montes… - Proceedings of the 2015 …, 2015 - aclanthology.org
Character n-grams have been identified as the most successful feature in both singledomain
and cross-domain Authorship Attribution (AA), but the reasons for their discriminative value …

Authorship attribution in the wild

M Koppel, J Schler, S Argamon - Language Resources and Evaluation, 2011 - Springer
Most previous work on authorship attribution has focused on the case in which we need to
attribute an anonymous document to one of a small set of candidate authors. In this paper …

N-gram feature selection for authorship identification

J Houvardas, E Stamatatos - International conference on artificial …, 2006 - Springer
Automatic authorship identification offers a valuable tool for supporting crime investigation
and security. It can be seen as a multi-class, single-label text categorization task. Character …