METHOD FOR RECOGNIZING TEXT INFORMATION FROM GRAPHIC FILE WITH USAGE OF DICTIONARIES AND ADDITIONAL DATA Russian patent published in 2007 - IPC G06K9/62 G06F17/21 G06F17/27 

Abstract RU 2295154 C1

FIELD: technology for recognizing text information from graphic file.

SUBSTANCE: in accordance to method, set in advance is order of access to additional information, assigned also is estimate of quality for each type of additional information, different variants of division of image of selected rows on fragments are constructed, for each fragment of row linear division graph is built, images of graphic elements are recognized, using a classifier, and an estimate is assigned to each recognition variant, transition from variants of recognition of graphic elements to variants of alphabet symbols is performed, for each chain, connecting starting and ending vertexes, chains are built, appropriate for all variants of recognition of graphical elements and variants of transitions from recognized graphical elements to alphabet symbols, produced variants are ranked in order of decrease of recognition quality estimate, produced variants are processed with usage of information about position of uppercase and lowercase letters, if more than one variant of symbol is available based on results of recognition of graphic element, variants are processed with successive usage of additional information, and/or when necessary simultaneous usage of all types of additional information, quality estimate is assigned to each produced variant, variants of symbols with estimate below predetermined value are discarded, produced variants are sorted using pair-wise comparison, and additional correction of recognition of spaces, erroneously recognized at previous stages, is performed.

EFFECT: increased precision of recognition of text and increased interference resistance of text recognition.

9 cl, 2 dwg

Similar patents RU2295154C1

Title Year Author Number
METHODS AND SYSTEMS FOR PROCESSING IMAGES OF MATHEMATICAL EXPRESSIONS 2014
  • Isupov Dmitry Sergeevich
  • Masalovitch Anton Andreevich
RU2596600C2
HANDWRITING RECOGNITION USING NEURAL NETWORKS 2020
  • Andrey Upshinskiy
RU2757713C1
METHOD FOR AUTOMATIC RECOGNITION OF LANGUAGE OF RECOGNIZED TEXT IN CASE OF MULTILINGUAL RECOGNITION 2002
  • Anisimovich K.V.
  • Tereshchenko V.V.
  • Rybkin V.Ju.
RU2251737C2
METHOD OF DETECTING NECESSITY OF STANDARD LEARNING FOR VERIFICATION OF RECOGNIZED TEXT 2014
  • Krivosheev Mikhail Viktorovich
  • Kolodkina Natalya Aleksandrovna
  • Makushev Aleksandr Sergeevich
RU2641225C2
IMAGE ANALYSIS METHOD, PARTICULARLY FOR MOBILE DEVICE 2008
  • Mosakovski Gerd
RU2454718C2
METHOD AND SYSTEM FOR EXTRACTING DATA FROM IMAGES OF SEMISTRUCTURED DOCUMENTS 2015
  • Kostyukov Mikhail Valerievich
RU2613846C2
OPTICAL CHARACTER RECOGNITION SYSTEM AND METHOD, REDUCING PROCESSING TIME FOR IMAGES POTENTIALLY NOT CONTAINING CHARACTERS 2014
  • Chulinin Yuri Georgievich
RU2571616C1
METHOD FOR TEXTUAL INFORMATION RECOGNITION AND ITS INTEGRITY EVALUATION IN INTERNET ELECTRONIC DOCUMENTS 2013
  • Molchanov Artem Nikolaevich
  • Skurnovich Aleksej Valentinovich
  • Stel'Makh Ehduard Petrovich
  • Molchanov Il'Ja Nikolaevich
RU2550543C1
COMPUTER EQUIPMENT FOR READING OF PRINTED TEXT 1996
  • Zolotov S.A.
  • Kalinin N.N.
  • Balakhontsev A.N.
RU2113726C1
DEVICES AND METHODS USING A HIERARCHIALLY ORDERED DATA STRUCTURE CONTAINING UNPARAMETRIC SYMBOLS FOR CONVERTING DOCUMENT IMAGES TO ELECTRONIC DOCUMENTS 2013
  • Chulinin Yurij Georgievich
RU2643465C2

RU 2 295 154 C1

Authors

Anisimovich Konstantin Vladimirovich

Rybkin Vladimir Jur'Evich

Shamis Aleksandr L'Vovich

Dates

2007-03-10Published

2005-06-16Filed