Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Online decision transformer

Q Zheng, A Zhang, A Grover - international conference on …, 2022 - proceedings.mlr.press
Recent work has shown that offline reinforcement learning (RL) can be formulated as a
sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via …

Outside the echo chamber: Optimizing the performative risk

JP Miller, JC Perdomo, T Zrnic - International Conference on …, 2021 - proceedings.mlr.press
In performative prediction, predictions guide decision-making and hence can influence the
distribution of future data. To date, work on performative prediction has focused on finding …

Performative power

M Hardt, M Jagadeesan… - Advances in Neural …, 2022 - proceedings.neurips.cc
We introduce the notion of performative power, which measures the ability of a firm
operating an algorithmic system, such as a digital content recommendation platform, to …

Anticipating performativity by predicting from predictions

C Mendler-Dünner, F Ding… - Advances in neural …, 2022 - proceedings.neurips.cc
Predictions about people, such as their expected educational achievement or their credit
risk, can be performative and shape the outcome that they are designed to predict …

How to learn when data reacts to your model: performative gradient descent

Z Izzo, L Ying, J Zou - International Conference on Machine …, 2021 - proceedings.mlr.press
Performative distribution shift captures the setting where the choice of which ML model is
deployed changes the data distribution. For example, a bank which uses the number of …

Stochastic optimization with decision-dependent distributions

D Drusvyatskiy, L Xiao - Mathematics of Operations …, 2023 - pubsonline.informs.org
Stochastic optimization problems often involve data distributions that change in reaction to
the decision variables. This is the case, for example, when members of the population …

Performative prediction in a stateful world

G Brown, S Hod, I Kalemaj - International conference on …, 2022 - proceedings.mlr.press
Deployed supervised machine learning models make predictions that interact with and
influence the world. This phenomenon is called performative prediction by Perdomo et …

Multiplayer performative prediction: Learning in decision-dependent games

A Narang, E Faulkner, D Drusvyatskiy, M Fazel… - Journal of Machine …, 2023 - jmlr.org
Learning problems commonly exhibit an interesting feedback mechanism wherein the
population data reacts to competing decision makers' actions. This paper formulates a new …

Alternative microfoundations for strategic classification

M Jagadeesan, C Mendler-Dünner… - … on Machine Learning, 2021 - proceedings.mlr.press
When reasoning about strategic behavior in a machine learning context it is tempting to
combine standard microfoundations of rational agents with the statistical decision theory …