CONTINUOUS CONTROL BY DEEP LEARNING AND REINFORCEMENT Russian patent published in 2019 - IPC G06N3/08 

Abstract RU 2686030 C1

FIELD: neural networks.

SUBSTANCE: group of inventions relates to neural networks and can be used to learn a neural network executor used to select actions to be performed by an agent interacting with the medium. Method includes obtaining a mini-packet of experimental tuples and updating current values of parameters of the neural network executing, containing for each experimental tuple in a mini-packet: processing the training observation and training action in the experimental tuple using the neural network critic to determine the neural network output for the experimental tuple and determining the predicted output of the neural network for the experimental tuple; updating current values of parameters of a neural network-critic using errors between predicted outputs of a neural network and outputs of a neural network and updating current values of parameters of a neural network executor using a neural network-critic.

EFFECT: high efficiency of learning.

13 cl, 4 dwg

Similar patents RU2686030C1

Title Year Author Number
METHOD FOR CREATING CONTROLLERS FOR CONTROLLING WALKING ROBOTS BASED ON REINFORCEMENT LEARNING 2022
  • Babaev Azer Kakhraman Ogly
  • Volchenkov Andrej Valerevich
  • Gorbachev Roman Aleksandrovich
  • Davydenko Egor Viktorovich
  • Dorzhieva Ekaterina Matveevna
  • Litvinenko Vladimir Viktorovich
  • Minashina Inna Konstantinovna
  • Sytnik Kirill Igorevich
RU2816639C1
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITIC 2013
  • Kljuchko Vladimir Ignat'Evich
  • Shumkov Evgenij Aleksandrovich
  • Karniz'Jan Roman Oganesovich
RU2523218C1
SYSTEM FOR ASSISTANCE IN SETTING OF INSTALLATION OPERATING MODE, TRAINING DEVICE, AND DEVICE FOR ASSISTANCE IN SETTING OF OPERATING MODE 2019
  • Yasui Takehito
  • Ikawa Shizuka
  • Toki Akifumi
  • Sogabe Masaru
  • Suwa Yusuke
RU2780340C2
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITICAL ELEMENT 2020
  • Shumkov Evgenij Aleksandrovich
RU2755339C1
MODIFIED INTELLIGENT CONTROLLER WITH ADAPTIVE CRITIC 2011
  • Kljuchko Vladimir Ignat'Evich
  • Shumkov Evgenij Aleksandrovich
RU2450336C1
REGISTRATION OF MEDICAL MAP 2017
  • Bresh Erik
  • Menkovski Vlado
RU2745400C2
METHOD OF X-RAY EXAMINATION OF SAMPLE 2023
  • Mareev Evgenij Igorevich
  • Minaev Nikita Vladimirovich
  • Garmatina Alena Andreevna
  • Dymshits Yurij Meerovich
  • Dyachkova Irina Gennadevna
  • Koldaev Vladimir Valerevich
  • Arsenev Andrej Sergeevich
  • Asadchikov Viktor Evgenevich
RU2812088C1
METHOD FOR ADAPTIVE ROUTE SELECTION IN A NODE OF A WIRELESS CELLULAR COMMUNICATION NETWORK, ASSOCIATED APPARATUS FOR IMPLEMENTING THE METHOD FOR ADAPTIVE ROUTE SELECTION, AND ASSOCIATED COMPUTER PROGRAM 2018
  • Dugaev, Dmitrii
  • Siemens, Eduard
RU2757663C1
PROACTIVE USER INTERFACE CONTAINING EVOLVING AGENT 2004
  • Li Dzong-Goo
  • Toledano Ehjal
  • Linder Natan
  • Ben-Jair Ran
  • Ehjzenberg Jariv
RU2331918C2
POWER MANAGEMENT SYSTEM 2022
  • Akhundov Fuad Namikovich
  • Kazakov Grigorij Borisovich
  • Semenov Sergej Sergeevich
  • Kireev Sergej Vyacheslavovich
  • Ivanov Vyacheslav Aleksandrovich
RU2821067C2

RU 2 686 030 C1

Authors

Lillicrap, Timothy Paul

Hunt, Jonathan James

Pritzel, Alexander

Heess, Nicolas Manfred Otto

Erez, Tom

Tassa, Yuval

Silver, David

Wierstra, Daniel Pieter

Dates

2019-04-23Published

2016-07-22Filed