FIELD: information technologies.
SUBSTANCE: method to detect text objects consists in the fact that: for each text object to be detected they generate a list of regular expressions, every of which describes this text object; a syntaxic analyser is created, designed for syntaxic analysis of regular expressions; an individual final automaton is generated on the basis of the syntaxic analyser for each regular expression; individual final automatons of all regular expressions are united into at least one search automaton, designed to search for text objects; search automatons are started on the text of the document to be verified to detect lines in it that represent text objects.
EFFECT: expanded arsenal of technical facilities due to creation of a comparatively fast method of detection of text objects.
7 cl
Title | Year | Author | Number |
---|---|---|---|
OBFUSCATION OF USER CONTENT IN STRUCTURED USER DATA FILES | 2018 |
|
RU2772300C2 |
CONFIGURED NOTES FOR HIGHLY CONFIDENTIAL USER CONTENT | 2018 |
|
RU2764393C2 |
SYSTEM, METHOD AND CONSTANT MACHINE-READABLE MEDIUM FOR VALIDATION OF WEB PAGES | 2015 |
|
RU2632149C2 |
COREFERENCE RESOLUTION IN AMBIGUITY-SENSITIVE NATURAL LANGUAGE PROCESSING SYSTEM | 2008 |
|
RU2480822C2 |
METHOD OF DATA TRANSFORMATION OF GEOINFORMATION SYSTEMS (GIS), SYSTEM FOR ITS IMPLEMENTATION AND METHOD OF SEARCH FOR THE DATA BASED ON THIS METHOD | 2017 |
|
RU2669143C1 |
COMPUTER SYSTEM AND METHOD FOR PREPARING TEXTS IN SOURCE LANGUAGE AND THEIR TRANSLATION INTO FOREIGN LANGUAGES | 1993 |
|
RU2136038C1 |
DEVICE TO PROCESS IMAGES, METHOD AND COMPUTER PROGRAMME TO PROCESS IMAGES | 2008 |
|
RU2437152C2 |
VOICE COMMUNICATION IN NATURAL LANGUAGE BETWEEN HUMAN AND DEVICE | 2014 |
|
RU2583150C1 |
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS | 2017 |
|
RU2665261C1 |
CLASSIFICATION OF DOCUMENTS BY LEVELS OF CONFIDENTIALITY | 2019 |
|
RU2732850C1 |
Authors
Dates
2013-11-10—Published
2012-02-14—Filed