IDENTIFICATION OF BLOCKS OF RELATED WORDS IN DOCUMENTS OF COMPLEX STRUCTURE Russian patent published in 2022 - IPC G06K9/18 G06K9/62 G06N3/02 

Abstract RU 2765884 C2

FIELD: computer technology.

SUBSTANCE: invention relates to the field of computer technology for identifying blocks of related words in documents using neural networks. A set of document words is obtained, the document has the first block of related words, a set of vectors representing the set of words is determined, the set of vectors is processed, using the first neural network to obtain a set of recalculated vectors having values based on the set of vectors, a set of connectedness values corresponding to connections between two words in the document is determined, and, using the set of recalculated vectors and the set of connectedness values, the first block of related symbol sequences is identified.

EFFECT: more efficient detection of blocks of related words in documents of complex structure, thereby increasing both the accuracy of identification and the processing speed of the computing device.

20 cl, 13 dwg

Similar patents RU2765884C2

Title Year Author Number
DETECTING SECTIONS OF TABLES IN DOCUMENTS BY NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT 2019
  • Stanislav Semenov
RU2721189C1
IDENTIFICATION OF FIELDS AND TABLES IN DOCUMENTS USING NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT 2019
  • Stanislav Semenov
RU2723293C1
RETRIEVING FIELDS USING NEURAL NETWORKS WITHOUT USING TEMPLATES 2019
  • Stanislav Semenov
RU2737720C1
DETECTING AND IDENTIFYING OBJECTS ON IMAGES 2020
  • Ivan Zagaynov
  • Andrew Zharkov
RU2726185C1
HANDWRITING RECOGNITION USING NEURAL NETWORKS 2020
  • Andrey Upshinskiy
RU2757713C1
DETECTING TEXT FIELDS USING NEURAL NETWORKS 2018
  • Zuev, Konstantin Alekseevich
  • Senkevich, Oleg Evgenyevich
  • Golubev, Sergei Vladimirovich
RU2699687C1
NEURAL NETWORK TRAINING BY MEANS OF SPECIALIZED LOSS FUNCTIONS 2018
  • Aleksey Alekseevich Zhuravlev
RU2707147C1
OPTICAL CHARACTER RECOGNITION BY MEANS OF COMBINATION OF NEURAL NETWORK MODELS 2020
  • Konstantin Anisimovich
  • Alexey Zhuravlev
RU2768211C1
IDENTIFICATION OF FIELDS ON AN IMAGE USING ARTIFICIAL INTELLIGENCE 2018
  • Kalenkov Maksim Petrovich
RU2695489C1
FUZZY SEARCH USING WORD FORMS FOR WORKING WITH BIG DATA 2021
  • Stanislav Semenov
RU2768233C1

RU 2 765 884 C2

Authors

Stanislav Semenov

Dates

2022-02-04Published

2019-12-17Filed