TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS Russian patent published in 2020 - IPC G06N3/02 

Abstract RU 2721190 C1

FIELD: data processing.

SUBSTANCE: invention relates to methods and computer-readable data medium for training neural networks. Method comprises determining a first tag associated with a current token processed by a neural network, a second tag associated with a previous token, processed by the neural network before processing the current token, a third tag associated with the next token to be processed by the neural network after processing the current token; calculating, for a training data sample, a loss function value reflecting a first, second and third loss values, respectively presented by the first difference between the first tag and the first label associated with the current word of the training data sample, a second difference between the second tag and the second label associated with the previous word of the training data sample, the third difference between the third tag and the third label associated with the next token of the training data sample; and tuning one or more parameters of the neural network depending on the value of the loss function.

EFFECT: technical result consists in improvement of mark-up quality of input sequences performed by a neural network.

20 cl, 5 dwg

Similar patents RU2721190C1

Title Year Author Number
METHOD FOR CONTROLLING A DIALOGUE AND NATURAL LANGUAGE RECOGNITION SYSTEM IN A PLATFORM OF VIRTUAL ASSISTANTS 2020
  • Ashmanov Stanislav Igorevich
  • Sukhachev Pavel Sergeevich
  • Zorkij Fedor Kirillovich
RU2759090C1
METHOD AND SYSTEM FOR RETRIEVING NAMED ENTITIES 2020
  • Emelyanov Anton Aleksandrovich
RU2760637C1
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT 2019
  • Terenin Aleksej Alekseevich
  • Kotova Margarita Aleksandrovna
RU2755606C2
SYSTEM AND METHOD FOR AUGMENTATION OF THE TRAINING SAMPLE FOR MACHINE LEARNING ALGORITHMS 2020
  • Shavrina Tatyana Olegovna
RU2758683C2
METHOD FOR PREDICTION OF DIAGNOSIS BASED ON DATA PROCESSING CONTAINING MEDICAL KNOWLEDGE 2019
  • Tarasov Denis Stanislavovich
RU2723674C1
METHOD OF CALCULATING CLIENT CREDIT RATING 2019
  • Babaev Dmitrij Leonidovich
  • Umerenkov Dmitrij Evgenevich
  • Savchenko Maksim Sergeevich
RU2723448C1
METHOD OF TRAINED RECURRENT NEURAL NETWORK DEBUGGING 2019
  • Zharov Yaroslav Maksimovich
  • Korzhenkov Denis Mikhajlovich
RU2715024C1
METHOD AND SYSTEM FOR ELIMINATING VULNERABILITIES IN PROGRAM CODE 2023
  • Vyshegorodtsev Kirill Evgenevich
  • Kuzmin Aleksandr Mikhajlovich
RU2821220C1
METHOD OF CREATING MODEL FOR ANALYSING DIALOGUES BASED ON ARTIFICIAL INTELLIGENCE FOR PROCESSING USER REQUESTS AND SYSTEM USING SUCH MODEL 2019
  • Antyukhov Denis Olegovich
  • Pugachev Leonid Petrovich
RU2730449C2
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1

RU 2 721 190 C1

Authors

Eugene Indenbom

Daniil Anastasiev

Dates

2020-05-18Published

2018-12-25Filed