FIELD: information technology.
SUBSTANCE: invention relates to detection of text in a bitmap image and a method of detecting spam, containing a bitmap image. The method of detecting text involves (a) identification of background colour in a two-colour bitmap image; (b) finding closed contours around each separate image of combined colour pixels, distinct from the said background colour; (c) exception from examination of contours which are too small or too big; (d) finding the presumed alpha characters on all closed contours, interpreted as contours of alpha characters; (e) division of lines of alpha characters found into sets, interpreted as probable words; (f) exception from examination of words in the lines which are too small or too big; (g) counting the number of pixels of non-background colour inside contours remaining after filtration; (h) determining on content of text in the image based on the ratio of the number of pixels of non-background colour inside contours remaining after filtration to the total number of pixels of non-background colour and under the condition that, at least one line remains.
EFFECT: simple and reliable method of detecting text and spam in bitmap images.
17 cl, 57 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD OF DETECTING AND LOCALIZING TEXT FORMS ON IMAGES | 2016 |
|
RU2697737C2 |
METHOD OF DETECTING SPAM IN BITMAP IMAGE | 2011 |
|
RU2453919C1 |
METHOD TO CONVERT BITMAPPED IMAGE INTO METAFILE | 2011 |
|
RU2469400C1 |
METHOD AND SYSTEM OF PREPARING TEXT-CONTAINING IMAGES TO OPTICAL RECOGNITION OF SYMBOLS | 2016 |
|
RU2636097C1 |
METHOD FOR RECOGNITION OF TEXT IN IMAGES OF DOCUMENTS | 2021 |
|
RU2768544C1 |
EDIT TEXT ON THE DOCUMENT IMAGE | 2016 |
|
RU2642409C1 |
METHOD AND DEVICE FOR EXTRACTING IMAGE AREA | 2015 |
|
RU2642404C2 |
METHOD AND SYSTEM FOR CONVERTING SCREENSHOT INTO METAFILE | 2013 |
|
RU2534005C2 |
DETECTING AND IDENTIFYING OBJECTS ON IMAGES | 2020 |
|
RU2726185C1 |
METHODS AND SYSTEMS FOR PROCESSING IMAGES OF MATHEMATICAL EXPRESSIONS | 2014 |
|
RU2596600C2 |
Authors
Dates
2009-07-27—Published
2007-10-31—Filed