Evaluation metrics for unsupervised learning algorithms

JO Palacio-Niño, F Berzal - arXiv preprint arXiv:1905.05667, 2019 - arxiv.org
… for clustering, and describe a taxonomy of evaluation criteria for unsupervised machine
learning. We also survey many of the evaluation metrics that have been proposed in the literature…

Unsupervised learning of linguistic structure: an empirical evaluation

D Powers - International Journal of Corpus Linguistics, 1997 - jbe-platform.com
… We conclude with an evaluation of the relative utility of a large array of different metrics and
… is a report on a language learning project, our approach to evaluating metrics is in no way …

A new distance metric for unsupervised learning of categorical data

H Jia, Y Cheung, J Liu - … on neural networks and learning …, 2015 - ieeexplore.ieee.org
… a metric to quantify the distance between categorical data for unsupervised learning well. In
this … The evaluations of clustering outcomes obtained by the k-modes algorithm with different …

How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation

CW Liu, R Lowe, IV Serban, M Noseworthy… - arXiv preprint arXiv …, 2016 - arxiv.org
… We investigate evaluation metrics for dialogue … metrics from machine translation to
compare a model’s generated response to a single target response. We show that these metrics

Unsupervised evaluation metrics and learning criteria for non-parallel textual transfer

RY Pang, K Gimpel - arXiv preprint arXiv:1810.11878, 2018 - arxiv.org
… 2018), but all either lack certain categories of unsupervised metric or lack human validation
of them, which we contribute. Moreover, the textual transfer community lacks discussion of …

SUPERT: Towards new frontiers in unsupervised evaluation metrics for multi-document summarization

Y Gao, W Zhao, S Eger - arXiv preprint arXiv:2005.03724, 2020 - arxiv.org
… for evaluating multidocument summaries, we investigate unsupervised evaluation methods,
which … In particular, we focus on evaluating the relevance (Peyrard, 2019) of multi-document …

A sober look at the unsupervised learning of disentangled representations and their evaluation

F Locatello, S Bauer, M Lucic, G Rätsch, S Gelly… - … of Machine Learning …, 2020 - jmlr.org
… As robustly capturing these statistical dependencies is a crucial step of the evaluation
metrics that do not rely on interventions, we argue that future work on disentanglement scores …

Uscore: An effective approach to fully unsupervised evaluation metrics for machine translation

J Belouadi, S Eger - arXiv preprint arXiv:2202.10062, 2022 - arxiv.org
unsupervised evaluation metrics. To do so, we leverage similarities and synergies between
evaluation metric … In particular, we use an unsupervised evaluation metric to mine pseudo-…

Unsupervised learning and clustering

D Greene, P Cunningham, R Mayer - … learning techniques for multimedia …, 2008 - Springer
… in unsupervised learning makes the question of evaluation and cluster quality assessment
more complicated than in supervised learning. So … of the features and metrics used during the …

Answers unite! unsupervised metrics for reinforced summarization models

T Scialom, S Lamprier, B Piwowarski… - arXiv preprint arXiv …, 2019 - arxiv.org
… the model parameters through Reinforcement Learning (RL) techniques. This makes the
choice of a good evaluation metric even more important. Unfortunately, ROUGE is known to …