FIELD: information technology.
SUBSTANCE: in one embodiment of the invention, a relatively small set of orientation symbols commonly found in printed text is used. In this case, for at least one set of orientation symbols, each of two or more different orientations containing a subdomain symbol in the text-containing area of scanned document image is compared with each orientation symbol in at least one set of orientation symbols to determine the orientation for each of subdomains, containing symbols, relative to the original orientation of area, containing the text. Orientations, identified for subdomains, containing symbols, are then used to determine the orientation of the text-containing image area of the scanned document.
EFFECT: ensuring the ability to convert printed documents containing text into non-alphabetical languages into appropriate electronic documents.
23 cl, 43 dwg
Authors
Dates
2017-07-31—Published
2015-12-02—Filed