FIELD: computer technology.
SUBSTANCE: invention relates to a system, a data carrier, and a method for converting a sequence based on attention. The method receives an input sequence having a respective input at each of multiple input positions in the order of input; the input sequence is processed by an encoder neural network to generate a corresponding encoded representation of each of the inputs in the input sequence, with the encoder neural network containing a sequence of one or more encoder subnets, while each encoder subnet is designed to receive a respective encoder subnet input for each of the multiple input positions and generate a corresponding subnet output for each of the multiple input positions, with each encoder subnet comprising an encoder self-attention sublayer that is designed to receive a subnet input for each from multiple input positions, and for each particular input position in the order of input, apply a self-attention mechanism to the encoder subnet inputs at multiple input positions to generate the appropriate output for that particular input position, at the same time, the application of the self-attention mechanism comprises the steps at which: the request from the subnet input is determined at the specified specific input position, the keys are determined that are obtained from the subnet inputs at the multiple input positions, the values obtained from the subnet inputs at the multiple input positions are determined, and the said determined request, keys and values are used for generating the said respective output for a specific input position; and the encoded representations are processed by the decoder neural network to generate an output sequence having a respective output at each of the multiple output positions in the output order.
EFFECT: improving the speed of sequence conversion.
30 cl, 3 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD OF SYNTHESIZING VIDEO FROM INPUT FRAME USING AUTOREGRESSIVE METHOD, USER ELECTRONIC DEVICE AND COMPUTER-READABLE MEDIUM FOR REALIZING SAID METHOD | 2023 |
|
RU2829010C1 |
METHOD FOR GENERATING THREE-DIMENSIONAL POINT CLOUDS | 2020 |
|
RU2745445C1 |
TEXT RECOGNITION USING ARTIFICIAL INTELLIGENCE | 2017 |
|
RU2691214C1 |
METHOD AND DEVICE FOR GENERATING VIDEO CLIP FROM TEXT DESCRIPTION AND SEQUENCE OF KEY POINTS SYNTHESIZED BY DIFFUSION MODEL | 2024 |
|
RU2823216C1 |
IMAGE RECOGNITION SYSTEM: BEORG SMART VISION | 2020 |
|
RU2777354C2 |
ELIMINATING IMAGE BLUR | 2020 |
|
RU2742346C1 |
METHOD AND SERVER FOR PERFORMING CONTEXT-SENSITIVE TRANSLATION | 2021 |
|
RU2812301C2 |
ANALOGUE HARDWARE IMPLEMENTATION OF NEURAL NETWORKS | 2020 |
|
RU2796649C2 |
METHOD OF CONTROLLING ROBOTIC VEHICLE | 2021 |
|
RU2808469C2 |
METHOD FOR IDENTIFYING THE USER WHO REQUESTED AN ELECTRONIC DOCUMENT BY THE RECEIVED FRAGMENT OF THE MARKED VERSION OF THE ELECTRONIC DOCUMENT AND THE STAGE OF MARKING THE ORIGINAL VERSION OF THE ELECTRONIC DOCUMENT REQUESTED BY THE USER | 2021 |
|
RU2771747C1 |
Authors
Dates
2021-06-21—Published
2018-05-23—Filed