Incentive control for multi-agent systems

D Mguni, S Ceppi, S Macua… - US Patent App. 17 …, 2021 - Google Patents
(57) ABSTRACT A machine learning system comprises: a set of agents, each having
associated processing circuitry and associated memory circuitry, the associated memory …

Reinforcement learning by sharing individual data within dynamic groups

CY Ma, ZH Wang, S Zhao, L Zhang - US Patent 12,026,610, 2024 - Google Patents
Methods and systems for reinforcement learning with dynamic agent grouping include
gathering information at a first agent using one or more sensors. Shared information is …

Determining control policies by minimizing the impact of delusion

T Lu, DE Schuurmans, CE Boutilier - US Patent App. 17/289,514, 2021 - Google Patents
BACKGROUND [0002] This specification relates to reinforcement learn ing.[0003] In a
reinforcement learning system, an agent inter acts with an environment by performing …

Trainable agent for traversing user interface

B Daei, PR Ghita, I Doumenc, J Gill - US Patent App. 16/940,854, 2022 - Google Patents
An example method of traversing a user interface of an interactive video game by a trainable
agent includes: iden tifying a current observable state of an interactive video game; …

Reinforcement learning using obfuscated environment models

JVW Reynders III - US Patent 11,144,847, 2021 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for training an action selection system used to select actions to be …

Method and system of personalized blending for content recommendation

R Shen, K Tsioutsiouliklis, D Kim, Y Ma… - US Patent App. 17 …, 2022 - Google Patents
The present teaching relates to personalized content recommendation. A webpage is
contrasted for a user having a plurality of slots each of which is to be allocated with a content …

Determining action selection policies of an execution device

H Li, L Song - US Patent 11,204,803, 2021 - Google Patents
Johanson et al.,“Efficient Nash Equilibrium Approximation through Monte Carlo
Counterfacutal Regret Minimization,” Conference: Autonomous Agents and Multiagent …

Method and apparatus for reinforcement machine learning

H Kim, ME Kim, S Kim, YS Son… - US Patent …, 2024 - Google Patents
A method and an apparatus for exclusive reinforcement learning are provided, comprising:
collecting information of states of an environment through the communication interface and …

Environment prediction using reinforcement learning

D Silver, T Schaul, M Hessel… - US Patent 12,141,677, 2024 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for prediction of an outcome related to an environment. In one aspect, a …

Systems and methods for accelerating model training in machine learning

FAT Abad, J Goodsitt, A Walters, R Farivar… - US Patent …, 2022 - Google Patents
Abstract Systems and methods are provided for training a model using machine learning. An
exemplary method may include providing, by the model in a training session, an action to an …