TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS Russian patent published in 2020 - IPC G06N3/02

Abstract RU 2721190 C1

FIELD: data processing.

SUBSTANCE: invention relates to methods and computer-readable data medium for training neural networks. Method comprises determining a first tag associated with a current token processed by a neural network, a second tag associated with a previous token, processed by the neural network before processing the current token, a third tag associated with the next token to be processed by the neural network after processing the current token; calculating, for a training data sample, a loss function value reflecting a first, second and third loss values, respectively presented by the first difference between the first tag and the first label associated with the current word of the training data sample, a second difference between the second tag and the second label associated with the previous word of the training data sample, the third difference between the third tag and the third label associated with the next token of the training data sample; and tuning one or more parameters of the neural network depending on the value of the loss function.

EFFECT: technical result consists in improvement of mark-up quality of input sequences performed by a neural network.

20 cl, 5 dwg

Similar patents RU2721190C1

Title	Year	Author	Number
METHOD FOR CONTROLLING A DIALOGUE AND NATURAL LANGUAGE RECOGNITION SYSTEM IN A PLATFORM OF VIRTUAL ASSISTANTS	2020	Ashmanov Stanislav Igorevich Sukhachev Pavel Sergeevich Zorkij Fedor Kirillovich	RU2759090C1
METHOD AND SYSTEM FOR RETRIEVING NAMED ENTITIES	2020	Emelyanov Anton Aleksandrovich	RU2760637C1
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT	2019	Terenin Aleksej Alekseevich Kotova Margarita Aleksandrovna	RU2755606C2
METHOD AND SYSTEM FOR RECOGNIZING INFORMATION CONSTITUTING TRADE SECRET	2024	Babak Nikita Grigorevich Belorybkin Leonid Yurevich Garbuzov Georgij Valerevich Denisov Vitalij Igorevich Terenin Aleksej Alekseevich Shabrova Anastasiya Igorevna	RU2841161C1
METHOD AND SYSTEM FOR OBTAINING VECTOR PRESENTATIONS OF DATA IN TABLE TAKING INTO ACCOUNT STRUCTURE OF TABLE AND ITS CONTENT	2024	Volkov Maksim Aleksandrovich	RU2839037C1
METHOD OF CALCULATING CLIENT CREDIT RATING	2019	Babaev Dmitrij Leonidovich Umerenkov Dmitrij Evgenevich Savchenko Maksim Sergeevich	RU2723448C1
SYSTEM AND METHOD FOR AUGMENTATION OF THE TRAINING SAMPLE FOR MACHINE LEARNING ALGORITHMS	2020	Shavrina Tatyana Olegovna	RU2758683C2
METHOD FOR PREDICTION OF DIAGNOSIS BASED ON DATA PROCESSING CONTAINING MEDICAL KNOWLEDGE	2019	Tarasov Denis Stanislavovich	RU2723674C1
METHOD OF TRAINED RECURRENT NEURAL NETWORK DEBUGGING	2019	Zharov Yaroslav Maksimovich Korzhenkov Denis Mikhajlovich	RU2715024C1
METHOD AND SYSTEM FOR ELIMINATING VULNERABILITIES IN PROGRAM CODE	2023	Vyshegorodtsev Kirill Evgenevich Kuzmin Aleksandr Mikhajlovich	RU2821220C1

RU 2 721 190 C1

Authors

Eugene Indenbom

Daniil Anastasiev

Dates

2020-05-18—Published

2018-12-25—Filed