METHOD FOR RECOGNIZING TEXT INFORMATION IN VECTOR-RASTER IMAGE Russian patent published in 2007 - IPC G06K9/36 

Abstract RU 2309456 C2

FIELD: advance processing of vector-raster image of graphic file, containing image of text.

SUBSTANCE: in accordance to the invention, processing of text objects includes division onto separate symbols and groups of symbols based on supposed locations of spaces or other non-display symbols and analysis or combination of symbol groups into words, processing of vector objects includes detection of separators, background, processing of raster objects includes analysis to detect presence of text image in non-text objects, and/or analysis of presence of vector objects, different from separators, including those exiting the limits of objects, while it is additionally possible to perform encoding correctness analysis, and correct when necessary, to that end separate symbols are examined to determine association with given alphabet, and text words are examined to determine association with given vocabulary.

EFFECT: increased reliability of recognition of text, raster and vector objects, production of information about formatting of document and acceleration of processing process.

3 cl

Similar patents RU2309456C2

Title Year Author Number
METHOD OF RECOGNISING GRAPHIC FORMAT MESSAGE CONTENT 2011
  • Zamarin Aleksandr Ivanovich
  • Sazonov Konstantin Viktorovich
RU2479028C2
METHOD FOR TEXTUAL INFORMATION RECOGNITION AND ITS INTEGRITY EVALUATION IN INTERNET ELECTRONIC DOCUMENTS 2013
  • Molchanov Artem Nikolaevich
  • Skurnovich Aleksej Valentinovich
  • Stel'Makh Ehduard Petrovich
  • Molchanov Il'Ja Nikolaevich
RU2550543C1
EDITING THE CONTENT OF AN ELECTRONIC DOCUMENT 2014
  • Korneev Ivan Yurevich
RU2656581C2
DEVICES AND METHODS, WHICH PREPARE PARAMETERED SYMBOLS FOR TRANSFORMING IMAGES OF DOCUMENTS INTO ELECTRONIC DOCUMENTS 2013
  • Chulinin Yurij Georgievich
RU2625020C1
APPARATUS AND METHOD OF SEARCHING FOR DIFFERENCES IN DOCUMENTS 2013
  • Panferov Vasily Vladimirovich
  • Isaev Andrey Anatolievich
  • Bobrova Catherine Yurievna
  • Zhukovskaya Olga Anatolievna
RU2571378C2
DEVICES AND METHODS, WHICH BUILD THE HIERARCHIALLY ORDINARY DATA STRUCTURE, CONTAINING NONPARAMETERIZED SYMBOLS FOR DOCUMENTS IMAGES CONVERSION TO ELECTRONIC DOCUMENTS 2013
  • Chulinin Yurij Georgievich
RU2625533C1
DEVICES AND METHODS USING A HIERARCHIALLY ORDERED DATA STRUCTURE CONTAINING UNPARAMETRIC SYMBOLS FOR CONVERTING DOCUMENT IMAGES TO ELECTRONIC DOCUMENTS 2013
  • Chulinin Yurij Georgievich
RU2643465C2
METHODS AND DEVICES THAT CONVERT IMAGES OF DOCUMENTS TO ELECTRONIC DOCUMENTS USING TRIE-DATA STRUCTURES CONTAINING UNPARAMETERIZED SYMBOLS FOR DEFINITION OF WORD AND MORPHEMES ON DOCUMENT IMAGE 2013
  • Chulinin Yurij Georgievich
RU2631168C2
METHOD OF RECOGNITION OF CONTENT OF COMPRESSED IMMOBILE GRAPHIC MESSAGES IN JPEG FORMAT 2018
  • Ivanov Vladimir Alekseevich
  • Skurnovich Aleksej Valentinovich
  • Revyakin Andrej Mikhajlovich
RU2680358C1
METHOD FOR AUTOMATIC RECOGNITION OF LANGUAGE OF RECOGNIZED TEXT IN CASE OF MULTILINGUAL RECOGNITION 2002
  • Anisimovich K.V.
  • Tereshchenko V.V.
  • Rybkin V.Ju.
RU2251737C2

RU 2 309 456 C2

Authors

Derjagin Dmitrij Georgievich

Sapronenko Vjacheslav Mikhajlovich

Dates

2007-10-27Published

2005-12-08Filed