Action selection by reinforcement learning and numerical optimization

US Patent 11,551,165, 2023 - Google Patents
Methods, systems, and apparatus, including computer pro grams encoded on a computer
storage medium, for selecting actions to be performed by an agent interacting with an …