FlowPG: action-constrained policy gradient with normalizing flows

J Brahmanage, J Ling, A Kumar - Advances in Neural …, 2024 - proceedings.neurips.cc
Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …

Tractable Boolean and arithmetic circuits

A Darwiche - Neuro-Symbolic Artificial Intelligence: The State of …, 2021 - ebooks.iospress.nl
Tractable Boolean and arithmetic circuits have been studied extensively in AI for over two
decades now. These circuits were initially proposed as “compiled objects,” meant to facilitate …

Knowledge compilation for constrained combinatorial action spaces in reinforcement learning

J Ling, ML Schuler, A Kumar, P Varakantham - 2023 - ink.library.smu.edu.sg
Action-constrained reinforcement learning (ACRL), where any action taken in a state must
satisfy given constraints, has several practical applications such as resource allocation in …

Graph neural network based agent in Google Research Football

J Liu, Y Niu, Y Shi, J Zhu - 2nd International Conference on …, 2022 - spiedigitallibrary.org
Deep neural networks (DNN) can approximate value functions or policies for reinforcement
learning, which makes the reinforcement learning algorithms more powerful. However, some …

多智能体路径规划综述.

刘志飞, 舄曹, 蔦赖, 陈希亮… - Journal of Computer …, 2022 - search.ebscohost.com
多智能体路径规划(multi-agent path finding, MAPF) 是为多个智能体规划路径的问题,
关键约束是多个智能体同时沿着规划路径行进而不会发生冲突. MAPF 在物流, 军事 …

[HTML][HTML] Formalizing Methods for Propositional Model Counting and Enumeration/submitted by Dipl.-Inform. Sibylle Rosa Natalina Möhle-Rotondi

SRN Möhle-Rotondi - 2022 - epub.jku.at
Many real-world problems, such as probabilistic reasoning, can be formulated as the task of
counting the models of a propositional formula, called# SAT. A model of a formula is an …

FlowPG: Action-constrained policy gradient with normalizing flows

BJC THILAKARATHNA, J LING, A KUMAR - 2023 - ink.library.smu.edu.sg
Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …

[PDF][PDF] FlowPG: Action-constrained policy gradient with normalizing flows.(2023)

BJC THILAKARATHNA, J LING… - Proceedings of the 37th … - ink.library.smu.edu.sg
Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-
critical and resource-allocation related decision making problems. A major challenge in …