FIELD: neural networks.
SUBSTANCE: invention relates to a method and a server for training an attention neural network (ANN) to generate a text output sequence. The technical result consists in the possibility of generating a relevant text output sequence, which is a summary of the content of a plurality of content resources. In the method, the server inputs a training request and a training text input sequence into the encoder subnet, while the training text input sequence (a) is formed as a sequence of training content snippets and (b) is divided into a sequence of input groups, and the input group is associated with a training content snippet and contains words from this learning content snippet; the encoded representation of the training text input sequence is generated by the server using the encoder subnet, including the generation of the attention output for the corresponding words from the training text input sequence by applying an attention restriction mask to the training text input sequence, and the attention output is used to form an encoded representation of the training text input sequence, when generating output data of the "attention" type for a word from the input group, the attention restriction mask allows you to "take into account" only words from this input group so that the output data of the "attention" type is formed based on the context this input group, and not the contexts of other input groups in the training text input sequence; generating, by the server using the decoder subnet, a decoded representation for the training text input sequence corresponding to the predicted text output sequence; generating, by the server, a penalty estimate for the training iteration by comparing the predicted text output sequence with a predetermined text output sequence representing a predetermined response to the training request; and adjusting the ANN network server based on the penalty estimate.
EFFECT: possibility of generating a relevant text output sequence, which is a summary of the content of a plurality of content resources.
32 cl, 7 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SYSTEM FOR CHECKING MEDIA CONTENT | 2022 |
|
RU2815896C2 |
METHOD AND SERVER FOR TRAINING MACHINE LEARNING ALGORITHM IN TRANSLATION | 2020 |
|
RU2770569C2 |
METHOD AND SERVER FOR PROCESSING TEXT SEQUENCE IN MACHINE PROCESSING TASK | 2020 |
|
RU2775820C2 |
METHOD AND SERVER FOR DETERMINING TRAINING SET FOR MACHINE LEARNING ALGORITHM (MLA) TRAINING | 2020 |
|
RU2817726C2 |
METHOD AND SERVER FOR GENERATING EXTENDED REQUEST | 2021 |
|
RU2813582C2 |
METHODS AND SERVERS FOR DETERMINING METRIC-DEPENDENT THRESHOLDS USED WITH MULTIPLE NESTED METRICS FOR BINARY CLASSIFICATION OF A DIGITAL OBJECT | 2020 |
|
RU2795202C2 |
METHOD AND A COMPUTER DEVICE FOR SELECTING A CURRENT CONTEXT-DEPENDENT RESPONSE FOR THE CURRENT USER REQUEST | 2017 |
|
RU2693332C1 |
METHOD AND SERVER FOR REPEATED TRAINING OF MACHINE LEARNING ALGORITHM | 2019 |
|
RU2743932C2 |
METHOD AND SERVER FOR CONVERTING TEXT TO SPEECH | 2020 |
|
RU2775821C2 |
METHOD AND SERVER FOR RANKING DIGITAL DOCUMENTS IN RESPONSE TO REQUEST | 2020 |
|
RU2818279C2 |
Authors
Dates
2023-06-21—Published
2020-10-06—Filed