IDENTIFICATION OF WRITING SYSTEMS USED IN DOCUMENTS Russian patent published in 2023 - IPC G06N3/02 G06F17/16 G06F40/20 G06V30/10 

Abstract RU 2792743 C1

FIELD: computer systems.

SUBSTANCE: present invention relates primarily to computer systems and, in particular, to systems and methods for identifying writing systems used in documents. The technical result consists in increasing the efficiency of optical character recognition by applying a neural network to image fragments, reducing each image fragment to the size of the input layer of the network, in order to reduce the requirements for computing resources. The technical result is achieved due to the following. The method includes: obtaining an image of a document; splitting the image into fragments; generating probability vectors by the means of a neural network containing a plurality of numerical elements, and each numerical element reflects the probability of an image fragment containing text associated with the corresponding writing system; calculating an aggregated probability vector, each numeric element of the aggregated probability vector reflecting a probability of an image containing text associated with a writing system; and given the determination that the maximum numerical element of the aggregated probability vector exceeds a certain threshold value, it is concluded that the document image contains one or more characters associated with the corresponding writing system.

EFFECT: increase in the efficiency of optical character recognition by applying a neural network to image fragments.

20 cl, 5 dwg

Similar patents RU2792743C1

Title Year Author Number
SYMBOLS RECOGNITION WITH THE USE OF ARTIFICIAL INTELLIGENCE 2017
  • Chulinin Yurij Georgievich
RU2661750C1
HANDWRITING RECOGNITION USING NEURAL NETWORKS 2020
  • Andrey Upshinskiy
RU2757713C1
OPTICAL CHARACTER RECOGNITION USING SPECIALIZED CONFIDENCE FUNCTIONS, IMPLEMENTED ON THE BASIS OF NEURAL NETWORKS 2018
  • Aleksey Alekseevich Zhuravlev
RU2703270C1
NEURAL NETWORK TRAINING BY MEANS OF SPECIALIZED LOSS FUNCTIONS 2018
  • Aleksey Alekseevich Zhuravlev
RU2707147C1
TEXT RECOGNITION USING ARTIFICIAL INTELLIGENCE 2017
  • Orlov Nikita Konstantinovich
  • Rybkin Vladimir Yurevich
  • Anisimovich Konstantin Vladimirovich
  • Davletshin Azat Ajdarovich
RU2691214C1
IDENTIFICATION OF FIELDS ON AN IMAGE USING ARTIFICIAL INTELLIGENCE 2018
  • Kalenkov Maksim Petrovich
RU2695489C1
OPTICAL CHARACTER RECOGNITION BY MEANS OF COMBINATION OF NEURAL NETWORK MODELS 2020
  • Konstantin Anisimovich
  • Alexey Zhuravlev
RU2768211C1
METHOD FOR RECOGNITION OF TEXT IN IMAGES OF DOCUMENTS 2021
  • Gordeev Dmitrij Vladimirovich
  • Kondratev Kirill Andreevich
  • Ostrovskij Konstantin Igorevich
RU2768544C1
PARENTAL CONTROL SYSTEMS AND METHODS FOR DETECTING THE DISCLOSURE OF CONFIDENTIAL INFORMATION 2020
  • Minea Cristian
  • Ion Cristian
  • Miron Adrian
  • Zavoiu Viorel
  • Holban Liviu A.
  • Bugoiu Bogdan
RU2796490C2
IDENTIFICATION OF BLOCKS OF RELATED WORDS IN DOCUMENTS OF COMPLEX STRUCTURE 2019
  • Stanislav Semenov
RU2765884C2

RU 2 792 743 C1

Authors

Ivan Zagaynov

Stanislav Semenov

Dmitriy Solntsev

Alexey Kalyuzhniy

Dates

2023-03-23Published

2021-11-23Filed