- 学术资源搜索

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：332 相关文章所有 9 个版本

[PDF] arxiv.org

Deep learning in mobile and wireless networking: A survey

C Zhang, P Patras, H Haddadi - IEEE Communications surveys …, 2019 - ieeexplore.ieee.org

The rapid uptake of mobile devices and the rising popularity of mobile applications and
services pose unprecedented demands on mobile and wireless networking infrastructure …

被引用次数：1934 相关文章所有 8 个版本

[PDF] arxiv.org

Describe, explain, plan and select: Interactive planning with large language models enables open-world multi-task agents

Z Wang, S Cai, G Chen, A Liu, X Ma, Y Liang - arXiv preprint arXiv …, 2023 - arxiv.org

We investigate the challenge of task planning for multi-task embodied agents in open-world
environments. Two main difficulties are identified: 1) executing plans in an open-world …

被引用次数：294 相关文章所有 3 个版本

[PDF] arxiv.org

Do as i can, not as i say: Grounding language in robotic affordances

M Ahn, A Brohan, N Brown, Y Chebotar… - arXiv preprint arXiv …, 2022 - arxiv.org

Large language models can encode a wealth of semantic knowledge about the world. Such
knowledge could be extremely useful to robots aiming to act upon high-level, temporally …

被引用次数：1465 相关文章所有 2 个版本

[PDF] neurips.cc

Video pretraining (vpt): Learning to act by watching unlabeled online videos

B Baker, I Akkaya, P Zhokov… - Advances in …, 2022 - proceedings.neurips.cc

Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for
training models with broad, general capabilities for text, images, and other modalities …

被引用次数：284 相关文章所有 6 个版本

[PDF] neurips.cc

Describe, explain, plan and select: interactive planning with llms enables open-world multi-task agents

Z Wang, S Cai, G Chen, A Liu… - Advances in Neural …, 2023 - proceedings.neurips.cc

In this paper, we study the problem of planning in Minecraft, a popular, democratized yet
challenging open-ended environment for developing multi-task embodied agents. We've …

被引用次数：73 相关文章所有 3 个版本

[PDF] arxiv.org

Jarvis-1: Open-world multi-task agents with memory-augmented multimodal language models

Z Wang, S Cai, A Liu, Y Jin, J Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Achieving human-like planning and control with multimodal observations in an open world is
a key milestone for more functional generalist agents. Existing approaches can handle …

被引用次数：78 相关文章所有 6 个版本

[PDF] arxiv.org

The neuro-symbolic concept learner: Interpreting scenes, words, and sentences from natural supervision

J Mao, C Gan, P Kohli, JB Tenenbaum, J Wu - arXiv preprint arXiv …, 2019 - arxiv.org

We propose the Neuro-Symbolic Concept Learner (NS-CL), a model that learns visual
concepts, words, and semantic parsing of sentences without explicit supervision on any of …

被引用次数：863 相关文章所有 6 个版本

A survey of zero-shot learning: Settings, methods, and applications

W Wang, VW Zheng, H Yu, C Miao - ACM Transactions on Intelligent …, 2019 - dl.acm.org

Most machine-learning methods focus on classifying instances whose classes have already
been seen in training. In practice, many applications require classifying instances whose …

被引用次数：764 相关文章所有 2 个版本

[PDF] arxiv.org

Calvin: A benchmark for language-conditioned policy learning for long-horizon robot manipulation tasks

O Mees, L Hermann, E Rosete-Beas… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org

General-purpose robots coexisting with humans in their environment must learn to relate
human language to their perceptions and actions to be useful in a range of daily tasks …

被引用次数：205 相关文章所有 5 个版本

高级搜索

QQ 群

Towards continual reinforcement learning: A review and perspectives

Deep learning in mobile and wireless networking: A survey

Describe, explain, plan and select: Interactive planning with large language models enables open-world multi-task agents

Do as i can, not as i say: Grounding language in robotic affordances

Video pretraining (vpt): Learning to act by watching unlabeled online videos

Describe, explain, plan and select: interactive planning with llms enables open-world multi-task agents

Jarvis-1: Open-world multi-task agents with memory-augmented multimodal language models

The neuro-symbolic concept learner: Interpreting scenes, words, and sentences from natural supervision

A survey of zero-shot learning: Settings, methods, and applications

Calvin: A benchmark for language-conditioned policy learning for long-horizon robot manipulation tasks

引用