On the impossible safety of large AI models

EM El-Mhamdi, S Farhadkhani, R Guerraoui… - arXiv preprint arXiv …, 2022 - arxiv.org
Large AI Models (LAIMs), of which large language models are the most prominent recent
example, showcase some impressive performance. However they have been empirically …

On the strategyproofness of the geometric median

EM El-Mhamdi, S Farhadkhani… - International …, 2023 - proceedings.mlr.press
The geometric median, an instrumental component of the secure machine learning toolbox,
is known to be effective when robustly aggregating models (or gradients), gathered from …

Identify ambiguous tasks combining crowdsourced labels by weighting Areas Under the Margin

T Lefort, B Charlier, A Joly, J Salmon - arXiv preprint arXiv:2209.15380, 2022 - arxiv.org
In supervised learning-for instance in image classification-modern massive datasets are
commonly labeled by a crowd of workers. The obtained labels in this crowdsourcing setting …

Robust sparse voting

Y Allouah, R Guerraoui, LN Hoang… - International …, 2024 - proceedings.mlr.press
Many applications, such as content moderation and recommendation, require reviewing and
scoring a large number of alternatives. Doing so robustly is however very challenging …

Should YouTube make recommendations for the climate?

M Gibert, LN Hoang, M Lambrecht - Ethics and Information Technology, 2024 - Springer
In this article, we argue that YouTube's algorithm should be programmed to make a modest
but significant percentage (eg 2%) of recommendations for the climate. Just as a librarian …

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

J Fageot, S Farhadkhani, LN Hoang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Many applications, eg in content recommendation, sports, or recruitment, leverage the
comparisons of alternatives to score those alternatives. The classical Bradley-Terry model …

Volition Learning: What Would You Prefer to Prefer?

M Lechiakh, A Maurer - International Conference on Human-Computer …, 2023 - Springer
When queried, humans do not always give their utmost attention to provide their best
possible answer. In particular, they may give different answers to the same question …

Tournesol: Permissionless collaborative algorithmic governance with security guarantees

LN Hoang, R Beylerian, B Colbois, J Fageot… - arXiv preprint arXiv …, 2022 - arxiv.org
Recommendation algorithms play an increasingly central role in our information ecosystem.
Yet, so far, they are mostly designed, parameterized and updated unilaterally by private …

Malicious website detection based on URL classification: A comparative analysis

S Maurya, A Jain - Proceedings of Third International Conference on …, 2022 - Springer
Phishing has been one of the most frequent cyber threats in the recent decade, prompting
an increase in anti-phishing research and the development of numerous solutions for …

Improve learning combining crowdsourced labels by weighting Areas Under the Margin

T Lefort, B Charlier, A Joly, J Salmon - 2022 - openreview.net
In supervised learning--for instance in image classification--modern massive datasets are
commonly labelled by a crowd of workers. The obtained labels in this crowdsourcing setting …