CY Ma, ZH Wang, S Zhao, L Zhang - US Patent 12,026,610, 2024 - Google Patents
Methods and systems for reinforcement learning with dynamic agent grouping include gathering information at a first agent using one or more sensors. Shared information is …
BACKGROUND [0002] This specification relates to reinforcement learn ing.[0003] In a reinforcement learning system, an agent inter acts with an environment by performing …
B Daei, PR Ghita, I Doumenc, J Gill - US Patent App. 16/940,854, 2022 - Google Patents
An example method of traversing a user interface of an interactive video game by a trainable agent includes: iden tifying a current observable state of an interactive video game; …
JVW Reynders III - US Patent 11,144,847, 2021 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection system used to select actions to be …
R Shen, K Tsioutsiouliklis, D Kim, Y Ma… - US Patent App. 17 …, 2022 - Google Patents
The present teaching relates to personalized content recommendation. A webpage is contrasted for a user having a plurality of slots each of which is to be allocated with a content …
H Li, L Song - US Patent 11,204,803, 2021 - Google Patents
Johanson et al.,“Efficient Nash Equilibrium Approximation through Monte Carlo Counterfacutal Regret Minimization,” Conference: Autonomous Agents and Multiagent …
H Kim, ME Kim, S Kim, YS Son… - US Patent …, 2024 - Google Patents
A method and an apparatus for exclusive reinforcement learning are provided, comprising: collecting information of states of an environment through the communication interface and …
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for prediction of an outcome related to an environment. In one aspect, a …
Abstract Systems and methods are provided for training a model using machine learning. An exemplary method may include providing, by the model in a training session, an action to an …