FIELD: neural networks.
SUBSTANCE: group of inventions relates to neural networks and can be used to learn a neural network executor used to select actions to be performed by an agent interacting with the medium. Method includes obtaining a mini-packet of experimental tuples and updating current values of parameters of the neural network executing, containing for each experimental tuple in a mini-packet: processing the training observation and training action in the experimental tuple using the neural network critic to determine the neural network output for the experimental tuple and determining the predicted output of the neural network for the experimental tuple; updating current values of parameters of a neural network-critic using errors between predicted outputs of a neural network and outputs of a neural network and updating current values of parameters of a neural network executor using a neural network-critic.
EFFECT: high efficiency of learning.
13 cl, 4 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD FOR CREATING CONTROLLERS FOR CONTROLLING WALKING ROBOTS BASED ON REINFORCEMENT LEARNING | 2022 |
|
RU2816639C1 |
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITIC | 2013 |
|
RU2523218C1 |
SYSTEM FOR ASSISTANCE IN SETTING OF INSTALLATION OPERATING MODE, TRAINING DEVICE, AND DEVICE FOR ASSISTANCE IN SETTING OF OPERATING MODE | 2019 |
|
RU2780340C2 |
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITICAL ELEMENT | 2020 |
|
RU2755339C1 |
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITIC | 2011 |
|
RU2450336C1 |
REGISTRATION OF MEDICAL MAP | 2017 |
|
RU2745400C2 |
METHOD OF X-RAY EXAMINATION OF SAMPLE | 2023 |
|
RU2812088C1 |
METHOD FOR ADAPTIVE ROUTE SELECTION IN A NODE OF A WIRELESS CELLULAR COMMUNICATION NETWORK, ASSOCIATED APPARATUS FOR IMPLEMENTING THE METHOD FOR ADAPTIVE ROUTE SELECTION, AND ASSOCIATED COMPUTER PROGRAM | 2018 |
|
RU2757663C1 |
PROACTIVE USER INTERFACE CONTAINING EVOLVING AGENT | 2004 |
|
RU2331918C2 |
POWER MANAGEMENT SYSTEM | 2022 |
|
RU2821067C2 |
Authors
Dates
2019-04-23—Published
2016-07-22—Filed