Lifelong learning with a changing action set

G Theocharous, Y Chandak - US Patent 11,501,207, 2022 - Google Patents
Systems and methods are described for a decision-making process that includes an
increasing set of actions, compute a policy function for a Markov decision process (MDP) for …

Adversarial Reinforcement Learning for Procedural Content Generation and Improved Generalization

LM Gisslén, AJ Eakins - US Patent App. 18/474,863, 2024 - Google Patents
BIIBYWQGRFWQKM-JVVROLKMSA-N (2S)-N-[4-(cyclopropylamino)-3, 4-dioxo-1-[(3S)-2-
oxopyrrolidin-3-yl] butan-2-yl]-2-[[(E)-3-(2, 4-dichlorophenyl) prop-2-enoyl] amino]-4, 4 …

Adversarial reinforcement learning for procedural content generation and improved generalization

LM Gisslén, AJ Eakins - US Patent 11,883,746, 2024 - Google Patents
Methods, apparatus and systems are provided for training a first reinforcement-learning (RL)
agent and a second RL agent coupled to a computer game environment using RL …