Training action selection neural networks using leave-one-out-updates

D Mguni, S Ceppi, S Macua… - US Patent App. 17 …, 2021 - Google Patents

(57) ABSTRACT A machine learning system comprises: a set of agents, each having
associated processing circuitry and associated memory circuitry, the associated memory …

被引用次数：8 相关文章所有 2 个版本

Reinforcement learning by sharing individual data within dynamic groups

CY Ma, ZH Wang, S Zhao, L Zhang - US Patent 12,026,610, 2024 - Google Patents

Methods and systems for reinforcement learning with dynamic agent grouping include
gathering information at a first agent using one or more sensors. Shared information is …

被引用次数：2 相关文章所有 2 个版本

[PDF] googleapis.com

Determining control policies by minimizing the impact of delusion

T Lu, DE Schuurmans, CE Boutilier - US Patent App. 17/289,514, 2021 - Google Patents

BACKGROUND [0002] This specification relates to reinforcement learn ing.[0003] In a
reinforcement learning system, an agent inter acts with an environment by performing …

被引用次数：2 相关文章所有 2 个版本

[PDF] googleapis.com

Trainable agent for traversing user interface

B Daei, PR Ghita, I Doumenc, J Gill - US Patent App. 16/940,854, 2022 - Google Patents

An example method of traversing a user interface of an interactive video game by a trainable
agent includes: iden tifying a current observable state of an interactive video game; …

被引用次数：1 相关文章所有 2 个版本

[PDF] googleapis.com

Reinforcement learning using obfuscated environment models

JVW Reynders III - US Patent 11,144,847, 2021 - Google Patents

Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for training an action selection system used to select actions to be …

被引用次数：1 相关文章所有 2 个版本

[PDF] googleapis.com

Method and system of personalized blending for content recommendation

R Shen, K Tsioutsiouliklis, D Kim, Y Ma… - US Patent App. 17 …, 2022 - Google Patents

The present teaching relates to personalized content recommendation. A webpage is
contrasted for a user having a plurality of slots each of which is to be allocated with a content …

Determining action selection policies of an execution device

H Li, L Song - US Patent 11,204,803, 2021 - Google Patents

Johanson et al.,“Efficient Nash Equilibrium Approximation through Monte Carlo
Counterfacutal Regret Minimization,” Conference: Autonomous Agents and Multiagent …

Method and apparatus for reinforcement machine learning

H Kim, ME Kim, S Kim, YS Son… - US Patent …, 2024 - Google Patents

A method and an apparatus for exclusive reinforcement learning are provided, comprising:
collecting information of states of an environment through the communication interface and …

被引用次数：1 相关文章所有 4 个版本

[PDF] googleapis.com

Environment prediction using reinforcement learning

D Silver, T Schaul, M Hessel… - US Patent 12,141,677, 2024 - Google Patents

Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for prediction of an outcome related to an environment. In one aspect, a …

Systems and methods for accelerating model training in machine learning

FAT Abad, J Goodsitt, A Walters, R Farivar… - US Patent …, 2022 - Google Patents

Abstract Systems and methods are provided for training a model using machine learning. An
exemplary method may include providing, by the model in a training session, an action to an …

高级搜索

QQ 群