Modeling uncertainty as ambiguity: A review

CL Ilut, M Schneider - 2022 - nber.org
We survey literature on ambiguity with an emphasis on recent applications in
macroeconomics and finance. Like risk, ambiguity leads to cautious behavior and …

Learning and self-confirming long-run biases

P Battigalli, A Francetich, G Lanzani… - Journal of Economic …, 2019 - Elsevier
We consider an ambiguity averse, sophisticated decision maker facing a recurrent decision
problem where information is generated endogenously. In this context, we study self …

Optimal learning under robustness and time-consistency

LG Epstein, S Ji - Operations Research, 2022 - pubsonline.informs.org
We model learning in a continuous-time Brownian setting where there is prior ambiguity.
The associated model of preference values robustness and is time-consistent. It is applied to …

Robust experimentation in the continuous time bandit problem

F Pourbabaee - Economic Theory, 2020 - Springer
We study the experimentation dynamics of a decision maker (DM) in a two-armed bandit
setup (Bolton and Harris in Econometrica 67 (2): 349–374, 1999), where the agent holds …

Gittins' theorem under uncertainty

SN Cohen, T Treetanthiploet - Electronic Journal of Probability, 2022 - projecteuclid.org
We study dynamic allocation problems for discrete time multi-armed bandits under
uncertainty, based on the the theory of nonlinear expectations. We show that, under …

High dimensional decision making, upper and lower bounds

F Pourbabaee - Economics Letters, 2021 - Elsevier
A decision maker's utility depends on her action a∈ A⊂ R d and the payoff relevant state of
the world θ∈ Θ. One can define the value of acquiring new information as the difference …

On the (non-) reliance on algorithms—A decision-theoretic account

B Sinclair-Desgagné - Journal of Mathematical Psychology, 2024 - Elsevier
A wealth of empirical evidence shows that people display opposite behaviors when deciding
whether to rely on an algorithm, even if it is inexpensive to do so and using the algorithm …

A Robust Exploration Strategy in Reinforcement Learning Based on Temporal Difference Error

MS Hajar, H Kalutarage, MO Al-Kadri - Australasian Joint Conference on …, 2022 - Springer
Exploration is a critical component in reinforcement learning algorithms. Exploration
exploitation trade-off is still a fundamental dilemma in reinforcement learning. The learning …

Stochastic control approach to the multi-armed bandit problems

T Treetanthiploet - 2021 - ora.ox.ac.uk
A multi-armed bandit is the simplest problem to study learning under uncertainty when
decisions affect information. A standard approach to the multi-armed bandit often gives a …

Decision making with dynamic uncertain continuous information

S Reches, M Kalech - Expert Systems with Applications, 2020 - Elsevier
Decision making is the ability to select the best alternative from a set of candidates based on
their respective values. When the value depends on uncertain future events, this task …