FIELD: machine learning.
SUBSTANCE: invention relates to a method and system for machine learning with support, i. e. the formation of an algorithm for the purposeful behavior of the system with the maximum expected long-term gain based on external supporting signals. A method for step-by-step learning of increasingly complex and time-consuming behavioral skills and their use for drawing up and correcting long-term plans is proposed. Purposeful behavior is formed by a hierarchical learning system, in which each hierarchical level is responsible for its own time scale of behavior.
EFFECT: technical result is the reduction in training time of the system.
8 cl, 5 dwg
Title | Year | Author | Number |
---|---|---|---|
COMPUTER SYSTEM AND METHOD FOR DETECTING MALWARE USING MACHINE LEARNING | 2021 |
|
RU2802860C1 |
METHOD AND APPARATUS FOR ADAPTIVE AUTOMATED CONTROL OF A HEATING, VENTILATION AND AIR CONDITIONING SYSTEM | 2021 |
|
RU2784191C1 |
OPTICAL CHARACTER RECOGNITION BY MEANS OF COMBINATION OF NEURAL NETWORK MODELS | 2020 |
|
RU2768211C1 |
METHOD FOR CREATING CONTROLLERS FOR CONTROLLING WALKING ROBOTS BASED ON REINFORCEMENT LEARNING | 2022 |
|
RU2816639C1 |
METHOD AND SYSTEM FOR TRAINING CHATBOT SYSTEM | 2023 |
|
RU2820264C1 |
METHOD AND DEVICE FOR ENTROPY CODING USING HIERARCHICAL DATA UNIT AND METHOD AND APPARATUS FOR DECODING | 2012 |
|
RU2597494C2 |
LAYER ALIGNMENT METHOD IN ENCODED VIDEO STREAM | 2020 |
|
RU2803890C1 |
METHOD AND DEVICE FOR ENTROPIC CODING USING HIERARCHICAL DATA UNIT AND METHOD AND DEVICE FOR DECODING | 2012 |
|
RU2635893C1 |
PROTECTION OF WEB APPLICATIONS WITH INTELLIGENT NETWORK SCREEN WITH AUTOMATIC APPLICATION MODELING | 2017 |
|
RU2659482C1 |
DEVICE AND METHOD FOR HIP JOINT DIAGNOSIS | 2022 |
|
RU2795658C1 |
Authors
Dates
2021-09-23—Published
2019-06-20—Filed