FIELD: physics.
SUBSTANCE: disclosed is a method of classifying one or more document images based on content thereof using a device with a processor. The method includes a step of obtaining a document image. Further, the method includes accessing a set of features stored in memory and analysing the document image to determine the arrangement of blocks. The method also includes recognising a document image using an optical symbol recognition technique to obtain digital content data representing textual content or potential graphical content.
EFFECT: high efficiency of classifying documents based on predetermined features.
27 cl, 3 dwg
Authors
Dates
2015-12-20—Published
2014-09-30—Filed