Integrating knowledge compilation with reinforcement learning for routes

J Brahmanage, J Ling, A Kumar - Advances in Neural …, 2024 - proceedings.neurips.cc

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

Tractable Boolean and arithmetic circuits

A Darwiche - Neuro-Symbolic Artificial Intelligence: The State of …, 2021 - ebooks.iospress.nl

Tractable Boolean and arithmetic circuits have been studied extensively in AI for over two
decades now. These circuits were initially proposed as “compiled objects,” meant to facilitate …

被引用次数：19 相关文章所有 5 个版本

[PDF] smu.edu.sg

Knowledge compilation for constrained combinatorial action spaces in reinforcement learning

J Ling, ML Schuler, A Kumar, P Varakantham - 2023 - ink.library.smu.edu.sg

Action-constrained reinforcement learning (ACRL), where any action taken in a state must
satisfy given constraints, has several practical applications such as resource allocation in …

被引用次数：4 相关文章所有 5 个版本

[PDF] spiedigitallibrary.org Full View

Graph neural network based agent in Google Research Football

J Liu, Y Niu, Y Shi, J Zhu - 2nd International Conference on …, 2022 - spiedigitallibrary.org

Deep neural networks (DNN) can approximate value functions or policies for reinforcement
learning, which makes the reinforcement learning algorithms more powerful. However, some …

被引用次数：4 相关文章所有 7 个版本

多智能体路径规划综述.

刘志飞，舄曹，蔦赖，陈希亮… - Journal of Computer …, 2022 - search.ebscohost.com

多智能体路径规划(multi-agent path finding, MAPF) 是为多个智能体规划路径的问题,
关键约束是多个智能体同时沿着规划路径行进而不会发生冲突. MAPF 在物流, 军事 …

被引用次数：2 相关文章

[HTML] jku.at

[HTML][HTML] Formalizing Methods for Propositional Model Counting and Enumeration/submitted by Dipl.-Inform. Sibylle Rosa Natalina Möhle-Rotondi

SRN Möhle-Rotondi - 2022 - epub.jku.at

Many real-world problems, such as probabilistic reasoning, can be formulated as the task of
counting the models of a propositional formula, called# SAT. A model of a formula is an …

FlowPG: Action-constrained policy gradient with normalizing flows

BJC THILAKARATHNA, J LING, A KUMAR - 2023 - ink.library.smu.edu.sg

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …

[PDF] smu.edu.sg

[PDF][PDF] FlowPG: Action-constrained policy gradient with normalizing flows.(2023)

BJC THILAKARATHNA, J LING… - Proceedings of the 37th … - ink.library.smu.edu.sg

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …

高级搜索

QQ 群