FIELD: computing technology.
SUBSTANCE: disclosed is a method for retrieving named entities from textual information, implemented by at least one computing apparatus, comprising the stages wherein: the textual information is obtained; the text is divided into words; the text is tokenised to obtain a token sequence; a set of vectors is formed by means of a neural network for the obtained token sequence; a vector representation of the token sequence is formed based on the obtained set of vectors; by comparing the indicators of the obtained vector representation of the token sequence with the predefined vector indicators obtained as a result of training of the neural network, named entities are predicted for the vector representation of the token sequence; the named entities obtained at the previous stage are identified by selecting the word mark.
EFFECT: increase in the accuracy of prediction of named entities.
8 cl, 2 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SYSTEM FOR EXTRACTING NAMED ENTITIES | 2021 |
|
RU2823914C2 |
METHOD AND SYSTEM FOR DIGITAL ASSISTANT TEXT GENERATION | 2022 |
|
RU2796208C1 |
METHOD AND SYSTEM FOR GENERATING TEXT | 2023 |
|
RU2817524C1 |
METHOD AND SYSTEM FOR PARAPHRASING TEXT | 2023 |
|
RU2814808C1 |
SYSTEM AND METHOD FOR AUGMENTATION OF THE TRAINING SAMPLE FOR MACHINE LEARNING ALGORITHMS | 2020 |
|
RU2758683C2 |
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT | 2019 |
|
RU2755606C2 |
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA | 2022 |
|
RU2804747C1 |
TEXT CLASSIFICATION METHOD AND SYSTEM | 2022 |
|
RU2818693C2 |
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA | 2022 |
|
RU2802549C1 |
TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS | 2018 |
|
RU2721190C1 |
Authors
Dates
2021-11-29—Published
2020-08-31—Filed