Vertex-reinforced random walk on Z has finite range

R Pemantle - 2007 - projecteuclid.org

The models surveyed include generalized Pólya urns, reinforced random walks, interacting
urn models, and continuous reinforced processes. Emphasis is on methods and results, with …

被引用次数：635 相关文章所有 22 个版本

[PDF] ox.ac.uk

On the convergence of reinforcement learning

AW Beggs - Journal of economic theory, 2005 - Elsevier

This paper examines the convergence of payoffs and strategies in Erev and Roth's model of
reinforcement learning. When all players use this rule it eliminates iteratively dominated …

被引用次数：269 相关文章所有 10 个版本

[PDF] springer.com

Limit theorems for triangular urn schemes

S Janson - Probability Theory and Related Fields, 2006 - Springer

We study a generalized Pólya urn with balls of two colours and a triangular replacement
matrix; the urn is not required to be balanced. We prove limit theorems describing the …

被引用次数：185 相关文章所有 14 个版本

[PDF] psu.edu

Generalizations of Polya's urn problem

F Chung, S Handjani, D Jungreis - Annals of combinatorics, 2003 - Springer

We consider generalizations of the classical Polya urn problem: Given finitely many bins
each containing one ball, suppose that additional balls arrive one at a time. For each new …

被引用次数：138 相关文章所有 14 个版本

[HTML] sciencedirect.com

[HTML][HTML] Learning to signal: Analysis of a micro-level reinforcement model

R Argiento, R Pemantle, B Skyrms, S Volkov - Stochastic processes and …, 2009 - Elsevier

We consider the following signaling game. Nature plays first from the set {1, 2}. Player 1 (the
Sender) sees this and plays from the set {A, B}. Player 2 (the Receiver) sees only Player 1's …

被引用次数：124 相关文章所有 19 个版本

[PDF] springer.com

Self-interacting diffusions

M Benaïm, M Ledoux, O Raimond - Probability theory and related fields, 2002 - Springer

This paper is concerned with a general class of self-interacting diffusions {X t} t≥ 0 living on
a compact Riemannian manifold M. These are solutions to stochastic differential equations …

被引用次数：102 相关文章所有 15 个版本

[PDF] ed.ac.uk

Attainability of boundary points under reinforcement learning

E Hopkins, M Posch - Games and Economic Behavior, 2005 - Elsevier

This paper investigates the properties of the most common form of reinforcement learning
(the “basic model” of Erev and Roth)[Amer. Econ. Rev. 88 (1998) 848–881]. Stochastic …

被引用次数：107 相关文章所有 16 个版本

[PDF] arxiv.org

Range-controlled random walks

L Régnier, O Bénichou, PL Krapivsky - Physical Review Letters, 2023 - APS

We introduce range-controlled random walks with hopping rates depending on the range N,
that is, the total number of previously distinct visited sites. We analyze a one-parameter class …

被引用次数：5 相关文章所有 12 个版本

[PDF] projecteuclid.org

Asymptotics in response-adaptive designs generated by a two-color, randomly reinforced urn

C May, N Flournoy - The Annals of Statistics, 2009 - projecteuclid.org

This paper illustrates asymptotic properties for a response-adaptive design generated by a
two-color, randomly reinforced urn model. The design considered is optimal in the sense …

被引用次数：74 相关文章所有 20 个版本

[PDF] projecteuclid.org

Vertex-reinforced random walk on arbitrary graphs

S Volkov - The Annals of Probability, 2001 - projecteuclid.org

Vertex-reinforced random walk (VRRW), defined by Pemantle, is a random process in a
continuously changing environment which is more likely to visit states it has visited before …

被引用次数：75 相关文章所有 9 个版本

高级搜索

QQ 群