OPTICAL CHARACTER RECOGNITION BY MEANS OF COMBINATION OF NEURAL NETWORK MODELS Russian patent published in 2022 - IPC G06V10/40 G06V30/18 G06N3/02 

Abstract RU 2768211 C1

FIELD: physics.

SUBSTANCE: invention relates to a text recognition method and system. In the method, a computer system obtains an image with text; selecting, using the feature extraction unit, a plurality of features from the image; applying a first decoder to a plurality of features to generate a first intermediate output, wherein the intermediate output is a hypothesis of a sequence of text characters; applying a second decoder to a plurality of features to generate a second intermediate output, where the feature extraction unit is shared by the first and second decoders; determining, based on the language model, a first quality metric value for a first intermediate output and a second quality metric value for a second intermediate output; and in response to determining that the value of the first quality metric is greater than the value of the second quality metric, selecting a first intermediate output for presenting the text.

EFFECT: high efficiency and accuracy of text recognition.

20 cl, 9 dwg

Similar patents RU2768211C1

Title Year Author Number
TEXT RECOGNITION USING ARTIFICIAL INTELLIGENCE 2017
  • Orlov Nikita Konstantinovich
  • Rybkin Vladimir Yurevich
  • Anisimovich Konstantin Vladimirovich
  • Davletshin Azat Ajdarovich
RU2691214C1
IMAGE RECOGNITION SYSTEM: BEORG SMART VISION 2020
  • Zuev Georgij Alekseevich
  • Kolosov Anton Aleksandrovich
RU2777354C2
TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS 2018
  • Eugene Indenbom
  • Daniil Anastasiev
RU2721190C1
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT 2019
  • Terenin Aleksej Alekseevich
  • Kotova Margarita Aleksandrovna
RU2755606C2
RECOGNITION OF EVENTS ON PHOTOGRAPHS WITH AUTOMATIC SELECTION OF ALBUMS 2020
  • Savchenko Andrey Vladimirovich
RU2742602C1
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION 2016
  • Li, Bo
  • Weiss, Ron J.
  • Bacchiani, Michiel A.U.
  • Sainath, Tara N.
  • Wilson, Kevin William
RU2698153C1
TEACHING LANGUAGE MODELS USING TEXT CORPUSES CONTAINING REALISTIC ERRORS OF OPTICAL CHARACTER RECOGNITION (OCR) 2019
  • Ivan Germanovich Zagaynov
RU2721187C1
METHOD OF CREATING MODEL FOR ANALYSING DIALOGUES BASED ON ARTIFICIAL INTELLIGENCE FOR PROCESSING USER REQUESTS AND SYSTEM USING SUCH MODEL 2019
  • Antyukhov Denis Olegovich
  • Pugachev Leonid Petrovich
RU2730449C2
DETECTING TEXT FIELDS USING NEURAL NETWORKS 2018
  • Zuev, Konstantin Alekseevich
  • Senkevich, Oleg Evgenyevich
  • Golubev, Sergei Vladimirovich
RU2699687C1
METHOD FOR FORMING BRAIN-COMPUTER CONTROL SYSTEM 2019
  • Bobe Anatolij Sergeevich
  • Rashkov Grigorij Vadimovich
  • Fastovets Dmitrij Vladislavovich
RU2704497C1

RU 2 768 211 C1

Authors

Konstantin Anisimovich

Alexey Zhuravlev

Dates

2022-03-23Published

2020-11-23Filed