The adaptive k-meteorologists problem and its application to structure learning and feature...

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org

A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

被引用次数：352 相关文章所有 18 个版本

[HTML] sciencedirect.com

[HTML][HTML] On the necessity of abstraction

G Konidaris - Current opinion in behavioral sciences, 2019 - Elsevier

A generally intelligent agent faces a dilemma: it requires a complex sensorimotor space to
be capable of solving a wide range of problems, but many tasks are only feasible given the …

被引用次数：95 相关文章所有 10 个版本

[PDF] mlr.press

Model-based rl in contextual decision processes: Pac bounds and exponential improvements over model-free approaches

W Sun, N Jiang, A Krishnamurthy… - … on learning theory, 2019 - proceedings.mlr.press

We study the sample complexity of model-based reinforcement learning (henceforth RL) in
general contextual decision processes that require strategic exploration to find a near …

被引用次数：241 相关文章所有 6 个版本

[PDF] academia.edu

[图书][B] Simulation-based optimization

A Gosavi - 2015 - Springer

This book is written for students and researchers in the field of industrial engineering,
computer science, operations research, management science, electrical engineering, and …

被引用次数：804 相关文章所有 11 个版本

[PDF] jmlr.org

[PDF][PDF] Reinforcement Learning in Finite MDPs: PAC Analysis.

AL Strehl, L Li, ML Littman - Journal of Machine Learning Research, 2009 - jmlr.org

We study the problem of learning near-optimal behavior in finite Markov Decision Processes
(MDPs) with a polynomial number of samples. These “PAC-MDP” algorithms include the …

被引用次数：374 相关文章所有 14 个版本

[PDF] springer.com

Knows what it knows: a framework for self-aware learning

L Li, ML Littman, TJ Walsh - … of the 25th international conference on …, 2008 - dl.acm.org

We introduce a learning framework that combines elements of the well-known PAC and
mistake-bound models. The KWIK (knows what it knows) framework was designed …

被引用次数：295 相关文章所有 32 个版本

[PDF] mlr.press

Sample-efficient reinforcement learning in the presence of exogenous information

Y Efroni, DJ Foster, D Misra… - … on Learning Theory, 2022 - proceedings.mlr.press

In real-world reinforcement learning applications the learner's observation space is
ubiquitously high-dimensional with both relevant and irrelevant information about the task at …

被引用次数：22 相关文章所有 3 个版本

[PDF] springer.com

Texplore: real-time sample-efficient reinforcement learning for robots

T Hester, P Stone - Machine learning, 2013 - Springer

The use of robots in society could be expanded by using reinforcement learning (RL) to
allow robots to learn and adapt to new situations online. RL is a paradigm for learning …

被引用次数：181 相关文章所有 19 个版本

[PDF] arxiv.org

Sample complexity of multi-task reinforcement learning

E Brunskill, L Li - arXiv preprint arXiv:1309.6821, 2013 - arxiv.org

Transferring knowledge across a sequence of reinforcement-learning tasks is challenging,
and has a number of important applications. Though there is encouraging empirical …

被引用次数：143 相关文章所有 14 个版本

[PDF] aaai.org

Learning probably approximately complete and safe action models for stochastic worlds

B Juba, R Stern - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

We consider the problem of learning action models for planning in unknown stochastic
environments that can be defined using the Probabilistic Planning Domain Description …

被引用次数：23 相关文章所有 8 个版本

高级搜索

QQ 群