CONTENT-BASED DOCUMENT IMAGE CLASSIFICATION Russian patent published in 2015 - IPC G06K9/00 G06F17/30 

Abstract RU 2571545 C1

FIELD: physics.

SUBSTANCE: disclosed is a method of classifying one or more document images based on content thereof using a device with a processor. The method includes a step of obtaining a document image. Further, the method includes accessing a set of features stored in memory and analysing the document image to determine the arrangement of blocks. The method also includes recognising a document image using an optical symbol recognition technique to obtain digital content data representing textual content or potential graphical content.

EFFECT: high efficiency of classifying documents based on predetermined features.

27 cl, 3 dwg

Similar patents RU2571545C1

Title Year Author Number
METHOD TO PROCESS DATA OF OPTICAL CHARACTER RECOGNITION (OCR), WHERE OUTPUT DATA INCLUDES CHARACTER IMAGES WITH AFFECTED VISIBILITY 2008
  • Fosseide Knut Tarald
  • Mejer Gans Kristian
RU2445699C1
COMPARING DOCUMENTS USING RELIABLE SOURCE 2014
  • Khintsitskij Ivan Petrovich
  • Isaev Andrej Anatolevich
RU2597163C2
APPARATUS AND METHOD OF SEARCHING FOR DIFFERENCES IN DOCUMENTS 2013
  • Panferov Vasily Vladimirovich
  • Isaev Andrey Anatolievich
  • Bobrova Catherine Yurievna
  • Zhukovskaya Olga Anatolievna
RU2571378C2
METHOD AND SYSTEM FOR CLASSIFYING AND FILTERING PROHIBITED CONTENT IN A NETWORK 2020
  • Prudkovskij Nikolaj Sergeevich
RU2738335C1
METHOD AND SYSTEM FOR EXTRACTING DATA FROM IMAGES OF SEMISTRUCTURED DOCUMENTS 2015
  • Kostyukov Mikhail Valerievich
RU2613846C2
METHODS AND SYSTEMS FOR AUTOMATIC RECOGNITION OF CHARACTERS USING FOREST SOLUTIONS 2015
  • Chulinin Yuri Georgievich
  • Vatlin Yury Aleksandrovich
RU2598300C2
METHOD TO PROCESS OUTPUT DATA OF OPTICAL CHARACTER RECOGNITION (OCR), WHERE OUTPUT DATA CONTAINS IMAGES OF TWICE PRINTED CHARACTERS 2008
  • Fosseide Knut Tarald
  • Mejer Gans Kristian
RU2439700C1
METHODS AND SYSTEMS FOR EFFECTIVE AUTOMATIC RECOGNITION OF SYMBOLS USING FOREST SOLUTIONS 2014
  • Chulinin Yuri Georgievich
  • Senkevich Oleg Evgenievich
RU2582064C1
DETECTION OF IMAGES OF SCREEN ON IMAGES OF DOCUMENTS 2014
  • Deryagin Dmitrij Georgievich
RU2595557C2
CLASSIFICATION OF DOCUMENT IMAGES BASED ON PARAMETERS OF COLOUR LAYERS 2015
  • Smirnov Anatoly Anatolevich
RU2603495C1

RU 2 571 545 C1

Authors

Smirnov Anatoly Anatolyevich

Panferov Vasily Vladimirovich

Isaev Andrey Anatolyevich

Dates

2015-12-20Published

2014-09-30Filed