FIELD: data processing.
SUBSTANCE: invention relates to methods and computer-readable data medium for training neural networks. Method comprises determining a first tag associated with a current token processed by a neural network, a second tag associated with a previous token, processed by the neural network before processing the current token, a third tag associated with the next token to be processed by the neural network after processing the current token; calculating, for a training data sample, a loss function value reflecting a first, second and third loss values, respectively presented by the first difference between the first tag and the first label associated with the current word of the training data sample, a second difference between the second tag and the second label associated with the previous word of the training data sample, the third difference between the third tag and the third label associated with the next token of the training data sample; and tuning one or more parameters of the neural network depending on the value of the loss function.
EFFECT: technical result consists in improvement of mark-up quality of input sequences performed by a neural network.
20 cl, 5 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD FOR CONTROLLING A DIALOGUE AND NATURAL LANGUAGE RECOGNITION SYSTEM IN A PLATFORM OF VIRTUAL ASSISTANTS | 2020 |
|
RU2759090C1 |
METHOD AND SYSTEM FOR RETRIEVING NAMED ENTITIES | 2020 |
|
RU2760637C1 |
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT | 2019 |
|
RU2755606C2 |
METHOD AND SYSTEM FOR RECOGNIZING INFORMATION CONSTITUTING TRADE SECRET | 2024 |
|
RU2841161C1 |
METHOD AND SYSTEM FOR OBTAINING VECTOR PRESENTATIONS OF DATA IN TABLE TAKING INTO ACCOUNT STRUCTURE OF TABLE AND ITS CONTENT | 2024 |
|
RU2839037C1 |
METHOD OF CALCULATING CLIENT CREDIT RATING | 2019 |
|
RU2723448C1 |
SYSTEM AND METHOD FOR AUGMENTATION OF THE TRAINING SAMPLE FOR MACHINE LEARNING ALGORITHMS | 2020 |
|
RU2758683C2 |
METHOD FOR PREDICTION OF DIAGNOSIS BASED ON DATA PROCESSING CONTAINING MEDICAL KNOWLEDGE | 2019 |
|
RU2723674C1 |
METHOD OF TRAINED RECURRENT NEURAL NETWORK DEBUGGING | 2019 |
|
RU2715024C1 |
METHOD AND SYSTEM FOR ELIMINATING VULNERABILITIES IN PROGRAM CODE | 2023 |
|
RU2821220C1 |
Authors
Dates
2020-05-18—Published
2018-12-25—Filed