IDENTIFICATION OF FIELDS AND TABLES IN DOCUMENTS USING NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT Russian patent published in 2020 - IPC G06N3/02 G06F17/00 

Abstract RU 2723293 C1

FIELD: data processing.

SUBSTANCE: invention relates to means for detecting text fields in documents using neural networks. Multiple sequences of symbols of a document containing a plurality of text fields are obtained. Determining plurality of vectors belonging to one of multiple sequences of symbols. Multiple vectors are processed using a first neural network to derive, based on values of a plurality of vectors, a plurality of recalculated vectors. Links between the first counted vector of the plurality of recalculated vectors and the first text field from the plurality of text fields are determined, wherein the first recalculated vector belongs to the first sequence of symbols from the plurality of sequences of symbols. Also, links between the first sequence of symbols and the first text field are determined based on the connection between the first recalculated vector and the first text field.

EFFECT: technical result consists in improvement of efficiency of data processing.

20 cl, 9 dwg

Similar patents RU2723293C1

Title Year Author Number
DETECTING SECTIONS OF TABLES IN DOCUMENTS BY NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT 2019
  • Stanislav Semenov
RU2721189C1
IDENTIFICATION OF BLOCKS OF RELATED WORDS IN DOCUMENTS OF COMPLEX STRUCTURE 2019
  • Stanislav Semenov
RU2765884C2
RETRIEVING FIELDS USING NEURAL NETWORKS WITHOUT USING TEMPLATES 2019
  • Stanislav Semenov
RU2737720C1
DETECTING AND IDENTIFYING OBJECTS ON IMAGES 2020
  • Ivan Zagaynov
  • Andrew Zharkov
RU2726185C1
DETECTING TEXT FIELDS USING NEURAL NETWORKS 2018
  • Zuev, Konstantin Alekseevich
  • Senkevich, Oleg Evgenyevich
  • Golubev, Sergei Vladimirovich
RU2699687C1
IDENTIFICATION OF FIELDS ON AN IMAGE USING ARTIFICIAL INTELLIGENCE 2018
  • Kalenkov Maksim Petrovich
RU2695489C1
METHODS AND SYSTEMS FOR IDENTIFYING FIELDS IN A DOCUMENT 2020
  • Semenov Stanislav Vladimirovich
  • Lanin Mikhail Olegovich
RU2760471C1
METHODS AND SYSTEMS FOR IDENTIFYING FIELDS IN A DOCUMENT 2021
  • Stanislav Semenov
RU2774653C1
NEURAL NETWORK TRAINING BY MEANS OF SPECIALIZED LOSS FUNCTIONS 2018
  • Aleksey Alekseevich Zhuravlev
RU2707147C1
OPTICAL CHARACTER RECOGNITION USING SPECIALIZED CONFIDENCE FUNCTIONS, IMPLEMENTED ON THE BASIS OF NEURAL NETWORKS 2018
  • Aleksey Alekseevich Zhuravlev
RU2703270C1

RU 2 723 293 C1

Authors

Stanislav Semenov

Dates

2020-06-09Published

2019-08-29Filed