FIELD: information technology.
SUBSTANCE: group of inventions relates to techniques for recognition of electronic documents. Disclosed is a method for comparing document images, implemented by the computing device containing a processor. Method includes a step of obtaining an image of the first document from the standard document and a corresponding image of the second document from the compared document. Further, according to the method, marking of the obtained images of the first and second documents is determined. Method also includes the first procedure of optical character recognition of the obtained images of the first and second documents and forming a standard dictionary, the standard dictionary contains words from the text block from the image of the first document.
EFFECT: high accuracy of character recognition by converting the compared image of the document based on marking the standard document image.
21 cl, 6 dwg
Authors
Dates
2016-09-10—Published
2014-11-06—Filed